Attorney Do(^^o. 13403.0004NPUS00 



CERTIFICATE OF MAILING BY "EXPRESS MAIL" 

ExpressMai.LabeINo.:EL615M^ Date of Deposit: AugustLiOOi 

Ad^esL^'S^^^^ ^Znf:T:' T ^"'^^^ Service "Express Mail Post Office . 

Patents, Washington, D.C 2oSl ^ '''' ^ ^^^--d to: AssistaT^s^nerTor 



APPLICATION IN 




Karen K^^V^o^tp 



THE UNITED STATES 
PATENT AND TRADEMARK OFFICE 

FOR 



COMPOSITIONS AND METHODS FOR GENERATING CHIMERIC 
HETEROMULTIMERS CHIMERIC 



INVENTORS: 



Caili Wang 

Judah St., Apt. 1 San Francisco, CA, 94122 
Citizenship: Peoples' Republic of China 

Pingyu Zhong 

5 Tyler Parkway, Mountain View, CA 94040 
Citizenship: USA 



Shengjiang Liu 

535 Devonshire Ct, Mountain View, 94043 
Citizenship: USA 



Peizhi Luo 

545 Dawn Dr., Sunnyvale, CA 94087 
Citizenship: USA 

Shengfeng Li 

1 14 Ladera Way, Belmont, CA 94003 
Citizenship: U.S.A. 



Xinwei Wang 

7221 Via Vico, San Jose, CA 95129 
Citizenship: Peoples' Republic of China 

HOWREY SIMON ARNOLD & WHITE, L 
750 Bering Drive, Houston, Texas 77057 
Attorney Docket No. 13403.0004NPUS00 



P: 95202(2 1G$0 1 !.DOC) 



1 



Attorney Doi^^n 1-,^ no ^ ^^^^nt 
0||p<fo. 13403.0004NPUS00 



This invention is in the fieM «f 

Antibodies or imm^gtow;^ are molecules 
*«ific cognate a„B™„, , *^"«»Sniz. and bind ,o 

^eam,e„, „f a variety of hn„,an diseases. 

'*=''--'™"nog,ob„,in„g,i„,^^^^, . 

.dentiea„ight(..L..,ehai„po,™epti,es,a • " """-^ °f <»« 

i^^iypeptides (approximately 23 irno\ ^ 
heavy (««») chain polypeptide, ■ ^' '^^^^^ical 

---.co.e„tdisn,.de,cr:::„T"^"^ 

"fOo-ains.^eLcba.nbast.odo.ains! ^ 
•'"•^ertotbeVre,„„c.VL-,rr; 

V region rv„., ^ ^^^^^ '^^^'^ T"' '<> 
«'ibody contains ,„„a„„,,. i„ fte C region. The 

'"''™'^™-^^^'^--).eachofwbicbbasaV. 
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Uie Fd fragment consisting of die VH anrt rm ^ ■ 

E S « , „ 8 «•= VH and CHI domatns; („i) the dAb fragment (Ward 

r"°""'*^^''"'^*~ = «-"''>^---pHsingtwoPa. 

-gn,ents,in.ed..adis„.de«dgeatd,e.inge.gio.and(vi).,ePvfragment 

oo.-s.n,gof.aeV.a„dV„d„m.nsofasi„g,ea™ofana.t.od.T.epZent 
-esma„est^etiona,„.t^„iredforMg,affini..«ndin.ofantigen 

repe«o2""""""'*^""'^""^^'=^"'°— 
~of™„„og,„,.i„.,a.mi„,,,,,^„„„^„^^,^__^^,^^^__^ 

«™™es.s.em.Snc.a„egene.,,,,asacomp,e.i.,™^ 

^'"'"^"'^''^f^ stabilizing means for assembly of the 
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d.s.ocia,e a. low protein co„ce„.a«o„s and are .„„ uns.ab,e tor n.^y app„ca«„ns a. 
physiCogioa, body ,en,pera,„.. „ , a,so a ,on,-reco^i.ed .echneal obs,ac,e fta. 
large proteins, such as whole anybodies (aibei. ex.e„ely s«e,, do no. exp,«s a. an 
appreciable ieve. in U,e hos, ce„. tas rendering u,e cons^rcion of a highiy diverse 
antibody repertoire very difficult. 

More recently, ti^ee approaches have been developed to ,e„e,.,e stable VL and 
VH complexes. However, each of these ,echr,i,„es bear, a nun,ber of intrinsic 
talons; and none of them events the aforementioned technical hurdles 

completely. The first approach uses a peptide linker to comtec. the VL and VH as a 

amgle-ch^n ("scFV, (Huston „98S) P,^. 

Whtle the resuldng scFv exhtbits substantial antigen-binding activity, not al, antibodies' 
can be made as single chains and still retain high bindmg afiinity (Huston « (,9S8) 
Pjc. Na,, .caa. Sci. U.S.,. 85:5879-5883; S.emmer (1993,.^,.*„„.. 
256-265, b par,, this is due to the interf^nce of linker sequences with the .tigen 
binding sites. The second approach involves inserting a pair of cysteine residues in d,e 
VL and VH regions to generate a disulfde-bond stabilized Fv ("dsFv") (Brinlonam, e, 
^>.<^m,Pro.. Aca,. Sc. U.S.A. 90(16): 7538-7542, The incorpo^ted disulfide 
■.nkage. however, is unstable u,der reducing conditions in many hos. cells For 
.ns^ce. in cy.osol of ^. Co«. «,e in.er-molecular disulfide bond is often insufficien. ,o 
stabilize Ure VL and VH complex. Moreover, .bis meti,od .ypically requires 3- 
d-mension^ sti™.ural infonna.ion of the V regions to ensure that the cysteine pair is 
inserted m a pt^per place without disruption the binding activity Because the 3- 

dtmensionalinformationofavastmajorityoftheexistingantibodiesism^nownUus 
app^ch has litfie practical utility, and is particularly unsuited for antibody library 
constiuction. especially for constntcting antibody repertoires derived ftom B cells The 
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Wrd approach for slabifeing a,, VL and ™ ■ 

-cH,a„dc.o...o.c:r::rrrj'^^^^^^^^^^^^ 

expression and antibody reperto.V, °' """"^ "^h level 

CerWn.- . ^"'"^'""''-'"i's large size, 
l-enam dimenzation sequence., ,h., r 

™P'oyedioassen,b,en,.«J„, r; °™'°''*^°''---'>-a,sobeen 

--.abispee.c,,br"i: 

;2 neterodimer linked hv th^ i? 
'^^^o-d.„„,e„ci„ezippe.3,,,^,,.^j;''7-»<'-""'«in«,ppe.. 

~..onn,e.erod„„.,.„j;^^^^^^^^^ 

«>™bo„odi™e. ^derpbysioiogica, bnffj T P""'-^'^ '° 

'~e<o..ea«..„,;:: 

wiWi-mercaptoethanylaminerseeTTQ d . 
co'unn, 7 ,ines 35-3, Pa,e„. No , ,32 443 

"'"V..bo«,^„.™<,^„„^^,^ . ^•^«--'"™>«Itaes,5-30). When,,,,^ 

--3o.us.pa.e„;,:3::::~^^ 

While fte existence of o T' 

--"-oncenrforZL 

lor me production of a sinele antiK^^ 

P-.saseno„sp„b,e„,foran,hodyrepe„r 

he.erod,a,ertza,io„be^een VLand i ""-'^Sh e&eneyof 

nvi. and VH regions is required. 

A^.defto™F<„a„dy™,e„ci„e^„_„,_ 
« describes d,e co„sm,c«on of a J ^ "-'O" 
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coneys fte se,ec„„„ of ..a„,ige„ pepHde.- bind ,„ .argeted a„Hb„.- . ' 

01 aescnbe or even suggest the use of coiled-ooiT 
sequences to generate stable antigen-bMng units 

.etbodsT:"'^ 

2*"as.oge„erat,s.ab,eantigen.b,„d,g„„,sandrep«^^^ 

5iaDietl,anaFvftagment,bntwouldpreferabtvh, „ , 
'0 allow la^e-scalenmH '"**'^'^™^"=^>h«naFabfrag„,en, 

a^r "'^^'"^"'"'"^^^^''-"-- — relatedadvan.ges 



speci«e?T""'"*"'"^"'~'^*^^--°'-«'»..-for 
J-«casse„bl,of_rlcpol^eptldes.ofo™,as.ablel,eteron,ultilr;, 

hefemm,.u- genetically diverse repertoire of 

heteromultimers such as antigen-binding units Th.t . • 

avarietyof-geneticnac. ^^^^^^ readily be adapted to 
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w uuy j 1 1 236, and WO09708320 
single-Chain antibodies (scFv, T. ■ 

anUbod ,„ ' * ™'"''^="-''"*"8-'sP-«cular,ys„i,edf„, 

antibody hbra,ycomlmctio„ and display Certain ™nfi„ 
binding uni, serve as convenien, ,Z 

convement bmld,„g >„„,s for muWvalem and mullispecific 
immunoglobuUns. "raspecilic 

-co;r;;::::rr^^ 

.g.on<,lrot "'"'^'''""~-''^'<^'*^ variable 

son ftsed ,0 a first heterodimerization sequence; (b) a heavy mi .h ■ , 
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In another aspect, the present invention provides a non-single-chain antigen- 
binding unit comprising: (a) a light (L) chain polypeptide comprising a light (L) chain 
variable region fused to a first heterodimerization sequence; (b) a heavy (H) chain 
polypeptide comprising a heavy (H) chain variable region fiised to a second 
heterodimerization sequence; wherein the L chain and the H chain polypeptides 
dimerize via pairwise affinity of the first and second heterodimerization sequences 
which are derived from heterodimeric receptors. In one aspect, the first and second 
heterodimerization sequences comprising heterodimerization receptor sequences that 
mediate heterodimerization of the receptors, hi yet another aspect, the subject 
heterodimerization sequences form a coiled-coil dimer. In still another aspect, the L and 
the H chain polypeptides dimerize via non-covalent pairwise affinity of the two 
heterodimerization sequences. Preferably, the L or the H chain polypeptide fiirther 
comprises a flexon that is flanked by the variable region and the heterodimerization 
sequence. Both the L and H polypeptide sequences may be derived from hmnan L and 
H chains. To further stabilize the heterodimeric Abus, cysteine residues can be 
introduced to provide disulfide bonds between the first and the second 
heterodimerization sequences. The non-single-chain antigen-binding units may be 
monovalent or multivalent. They may be monospecific or multispecific. Prefeired 
multispecific Abus are bispecific, trispecific and tetraspecific molecules. 

In a separate embodiment, the present invention provides a single-chain antigen- 
binding unit comprising a light (L) chain variable region and a heavy (H) chain variable 
region comiected by a first and a second heterodimerization sequence spamiing the 
distance between the C-terminus of one of the region to the N-teiminus of the other 
region, wherein the two regions form an intra-molecular dimer via pairwise affinity of 
the first and second heterodimerization sequences; and wherein at least one of the 
heterodimerization sequences is essentially incapable of forming a homodimer under 
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physiological buffer conditions and/or at physiological body temperatures. Preferably, 
both of the first and second heterodimerization sequences are essentially incapable of 
forming homodimers under physiological buffer conditions and at physiological body 
temperatures. 

In another aspect, the present invention provides a single-chain antigen-binding 
unit, wherein the VL and VH regions form an intra-molecular dimer via pairwise 
affinity of the first and second heterodimerization sequences which are derived from 
heterodimeric receptors. In one aspect, the first and second heterodimerization 
sequences comprising heterodimerization receptor sequences that mediate 
heterodimerization of the receptors. 

hi yet another aspect, first and second heterodimerization sequences form a 
coiled-coil dimer. hi another aspect, the first and second heterodimerization sequences 
dimerize via non-covalent pairwise affinity. Both the VL and VH regions can be 
derived fi-om the corresponding sequences in a human L and H chains, respectively. 

Both the non-single-chain and single-chain antigen-binding units can be 
conjugated to a chemically fimctional moiety. Exemplary functional moieties include 
but are not limited to signal peptides, agents that enhance immunologic reactivity, 
agents that facilitate coupling to a soUd support, vaccine carriers, bioresponse modifiers, 
toxins, detectable labels, paramagnetic labels, and drugs. 

Rt^feired heterodimerization sequences contained in the subject antigen-binding 
units are derivedli=ollr«4gQi^^ of GABAb receptor 1 and GABAb receptor 

2, respectively. More preferablj^ih^^rsHiete^^ sequence comprises a 
polypeptide of at least 30 amino acid residues that iT^^^JltiaH^cidenticd to a linear 
peptide sequence of comparable length depicted in SEQ ID NO. 2; a^dtlJe^ecmid 
heterodimerization sequence comprises a polypeptide of at least 30 amino acid r^id^I^ 
that is essentially identical to a linear peptide sequence of comparable length depicted in 
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polypeptide ofat least 30 
peptide sequence of comp 
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SEQ ID NO. 4. AltematUly, the first heterodmerization sequence comprises a 

amino acid residues that is essentially identical to a linear 
•able length depicted in SEQ ID NO. 4; and the second 
.eterodimerization sequence comprising a polypeptide ofat least 30 amino acid residues 
that is essentially identical tl a linear peptide sequence of comparable length depicted in 



SEQ ID NO. 2. 



The present invention provides a recombinant polynucleotide comprising a 
coding sequence that encodes the L and/or H polypeptide of a non-single-chain antigen- 
hndmg unit. The invention also provides a recombinant polynucleotide comprising a 
coding sequence that encodes the VL or VH regions of a single^hain antigen-binding 
untt. Also p„,vided is a vector comprising any one of the recombinant polynucleotides 
described herein. ITre vector can be an expression vector, e.g. a phage display vector 
Further provided in this invention is a selectable library of exp,.ssion vectors encoding 
a repertoire of antigen binding units, comprising more than one subject vector. 
Preferably, the selectable library comprises a plurafity of phage display vecto,.. 

The present invention also provides a host cell comprising the subject 
recombinant polynucleotides. The recombinant polynucleotide encoding the L chain 
polypeptide and the polynucleotide encoding the H chain polypeptide may be present in 
a smgle vector or in separate vectors. The host cell may be eukatyotic or prokaryotic. 

to yet another embodiment, the ptesent invention provides a mediod of 
Ptoducing a non-single-chain antigen-binding unit. The method involves flte following 
steps: (a) expressing in a host cell a flrs. recombinant polynucleotide encoding a light 
(L) chain polypeptide comprising a light (L) chain variable region fltsed to a Rrsi 
hetetodimerization sequence, and a second recombinant polynucleotide encoding a 
heavy (H) chain polypeptide comprising a heavy (H) chain variable region fused to a 
second heterodinterization sequence; wherein the L chain and fl>e H chain polypeptides 
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dimerize via pai™ise affi^.y ot„. fi., and second he.er„dimeriza,i„„ sequences; and 
wherem a. leas, one of the heterodimerizadon sequences is essenMy incapable of 
forming a homodimer under physiological buffer conditions and/or a. physiological 
body .empc^ures; and optionally (b) isolating U,e antigen-binding unit expressed in the 

host cell. 

The produced antigen-binding unit may also contain hetetodimerizaUon 
sequences that are derived fron, heterodimeric receptors. Additionally, the non-single- 
chatn antigen-binding expressed in step (a) can be displayed on surface of the host cell 
P^ferably. the non-single-chain antigen-binding expressed in step (a) is displayed on a 

phage particle. 

In still another embodiment, the present invention provides a method of 
pmducing a non-single-chain antigen-binding unit, the method comprises the steps of 
(a) preparing a first recombinant pol^ucleotide encoding a light (L) chain polypeptide 
comprising a light (L) chain variable region fused to a first hetetodimerization sequence 
and a second recombinant polynucleotide encoding a heavy (H) chain poljpeptide 
comprising a heavy (H) chain viable tegion ftsed to a second heterodhnerization 
sequence; wherein tire L chain and tite H chain polypeptides dimerize via pairwise 
afnmty of tite first and second hetetodimerization sequences; and whcin at least one of 
«>e heterodimerization sequences is essentially incapable of fonning a homoduner under 

Phystological buffer conditions and/or at physiological bodytemperatutes; and (b) 
allowing fl,e first and second polypeptides to dimerize via pai^se ailtoity of the first 
and second heterodimerization sequences. The step of dimerization may take place /„ 

vitro or in vivo. 

This invention also includes a method of ptoducing a single-chain antigen- 
bmding „mt. The metitods involves the steps of (a) expressing m a host cell a 
polynucleotide comprishtg a coding sequence that encodes flte subject single-chain 
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antigen-binding unit; and optionally (b) isolating the single-chain antigen-binding unit 
expressed in the host cell. 

This invention further includes a method of displaying a chimeric 
heteromultimer comprising at least two polypeptides on a surface of a host cell. This 
method comprises expressing in the host cell (i) a first recombinant polynucleotide 
encoding a first polypeptide fused to a first heterodimerization sequence and a surface 
presenting sequence; (ii) a second recombinant polynucleotide encoding a second 
polypeptide fused to a second heterodimerization sequence; wherein the first and second 
polypeptides dimerize via pairwise affinity of the first and second heterodimerization 
sequences; wherein at least one of the heterodimerization sequences is incapable of 
forming a homodimer under physiological buffer conditions and/or at physiological 
body temperatures. In one aspect, the first and second polynucleotides are expressed by 
a single phage display vector. In another aspect, the first and second polynucleotides 
are expressed by separate phage display vectors. The chimeric heteromultimer is 
preferably a non-single-chain antigen-binding unit of the present invention. 

The invention also encompasses a method of identifying a non-single-chain 
antigen-binding unit that is immunoreactive with a desired antigen. The method 
comprises the steps of: (a) preparing a genetically diverse repertoire of antigen-binding 
units, wherein the repertoire comprises more than one subject antigen-binding unit; (b) 
contacting the repertoire of antigen binding units with the desired antigen; and (c) 
detecting a specific binding between antigen binding units and the antigen, thereby 
idenfifying the antigen-binding unit that is immunoreactive with the desired antigen. In 
one aspect of this embodiment, the repertoire of antigen-binding units is prepared by 
expressing a library of vectors encoding a plurality of the antigen-binding units. 
Preferably, the library of vectors comprises a plurality of phage vectors. 
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Finally, the present invention provides a kit comprising a vector of this invention 
in suitable packaging. 

EXPLAINATTON OF ARRRF.VT/^tIONS TTSFn MPPTijtxt 

1. Nsc: Non-single chain 

2. Sc: Sing-chain 

3. Abu: Antigen-binding unit 

4. Abus: Antigen-binding units 

4. L chain: Light chain 

5. H chain: Heavy chain 

6. VL: Light chain variable region 

7. VH: Heavy chain variable region 

BRffiF DESCRTPTTOM n v THE PR AWTNHs; 

Figure 1 is a schematic representation depicting various antigen-binding units. 
2 depicts the nucleotide and amino acid sequences of GABAb receptor 1 
and 2 that '^^^r^in constructing the subject Abus. The coiled-coil sequences are 
derived from hum^iS^BAB-Rl and GABAb-R2 receptors. The coding amino acid 
sequences from GABAb reJ^^to^^Tbegi^ with EEKS and ends with QLQS as shown in 
the top panel (SEQ ID NO. 2). The^^Steg^ino acid sequences from GABAb receptor 
2 begins with TSRL and ends with QLQD ^"^^bvoim the bottom panel (SEQ ID 
N0.4). A flexible SerArgGlyGlyGlyGly spacer waT^^a^ the amino-termini of Rl 
and R2 heterodimerization sequences to favor the formationihwtional Fv 
heterodimer. To further stabilize the heterodimer, we have introdiWVA(alGlyGlyCys 
spacer to lock the heterodimeric coiled-coil pair via the disulfide bond bet^^Sthe 
cysteine residues. The SerArg coding sequences at the N-terminus of the GGGG^^^ 
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U providesXbalorXhoIsitesforfusionofjheGRl (heterodimerization sequence derived 
/ ^ 1^ from GABAal receptor) and GR2 (heterJdimerizat:on sequence derived from GABA32) 

domains to fhe carboxyl-tennini of VH a|d VL fragments, respectively. 
38 Figure 3A is a schematic representation of two expression vectors pABMXl and 

PABMX2. Both pABMXl and pABMX2 were derived from pbluescript SK(+) 
comprising an ampiciUin-resistance gene (Amp) for antibiotic selection, a plasmid 
origin of rephcation (colEl ori), a fl phage origin of replication (fl ori), and lac 
promoter/lac 01 driven protein expression cassette (plac-RBS-p8Leader-DH tag for 
PABMXl, plac-RBS-pelBLeader-DHtagforpABMX2). The heterologous sequence 
is expressed as a DH- tag (HA and 6 x His tag) fission protein, and is directed by the 
signal peptide either p8 leader or pelB leader into periplasmid space, where the leader 
sequence is cleaved 

Figure 3B depicts the sequences (SEQ ID NOS. 5-8) after the lac promoter 
between Agel and Bglll sites of pABMXl and pABMX2. The Hindlll/Xbal or 
Hindlll/NotI or Xbal/NotI sites can be used for inserting heterologous sequences that 
are to be expressed in pABMXl vector. Additional cloning sites included in the 
pABMX2 vector are Ncol, PstI, Xbal and Notl. 

Figure 4A is a schematic representation of the phagemid vectors pABMDl and 
PABMD2 usefiil for displaying antigen-binding units. pABMDl and ABMD2 were 
derived from PABMXl and pABMX2 respectively. They comprise all of the functional 
elements of pABMXl and pABMX2 vectors, and pin gene of a filamentous phage. The 
PIII gene was inserted immediately adjacent to the 3' end of DH-tag. The lac promoter 
drives expression of a heterologous sequence as pHI capsid fission protein, which in tuxn 
is displayed on a phage particle upon superinfection of a helper phage such as K07 
(Amersham Pharmacia Biotech) or R408 helper phage (Stratagene). This vector can 
also be used for soluble protein expression in a non-suppressor bacterial strain. 
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Figure 4B depicts the sequence (SEQ ID NOS 9 .,, , 

. ^ ^ ^"'2) after the lac promoter 

between Agel and Sail sites of pABMDl and pABMD2. 

Figure 5A is a schematic representation of the vectc. pABMX5 and pABMX6 

PABMX.a„dABMX.werede„ved«™.p^MX,a„dp„respect.i 
Dtfferent leader sequences were incor^rated into pABMXS and pABMX6 The 

»bc.oni„gsitesf„rinsert.onofhetero.ogousse,„e„ce,e.,V„gene.a,sod,^ 

h^e two vector.. pABMX5co„tainsthep8,eader,andp„containsthepe,B 

^ader.Twoproteinexpressio„easse.tese.p,oy„,g,,e,acpro„oterwere»^^^^^^ 
.nto these two vector. Tlre first cassette is used to express VH-GRl (VH 

(VL- heterodimerization sequence of GABA. receptor 2, The DH-tag was 
fcsed to GR2 domain to facilitate purification of the resulting heterodimer. 

Ftgure 5B depicts the sequences (SEQ m NOS. 13-16) het„e«, the leader 
sequencea„dD„..ag,np^MX5andpABMX6vecto,..h,addi«on,theHhosonre- 

taiir "'^^ - 

..JT" ' ' — • "ABMDS and 

™,wMchareusefi.lforexpressinganddispla,„gccFvo„aphagepartic,e. 

pABMB5andABMO.„erede.vedfiompABMX5a„dpABMX.respective,,pn: 
*™ede„.cdi^om«,efilan.entousphagewasinse„edinunediate,y^^^ 

neterodimer. 

Fi^re 6 B depicts the sequences (SEQ U, NOS. 17-20) between the leader 

;.7r"^'"^°''^"''^™^''^------«on,Hhosomebtndi„g 

m-tag. partial p,n , subcloning sites for insertion of VH, VL, GRl and GR2 are 
also indicated. ' 
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yeast. 



Figure 7 depicts the vector pAMEX7 useful for expressing ccFv fragment in 



Figure 8 depicts the results of an ELISA assay using AM2-scFv fragments that 

wereexpressedbythepABMXl vector. TT^e results show a dose-dependent AM2-scFv 
binding to its antigen AM2. 

Figure 9 depicts the results of an ELISA assay using AM2-scFv fragments that 
were displayed on phage particles. The results demonsfrate the assembly of functional 
scFv fragments on phage particles using the phagemid vector pABMD 1 . 

Figure lOA depicts the results of SDS-PAGE analysis of AMl-ccFv expressed 
in E. coH under reducing and non-reducing condition. The results demonsfrate 
successfiil expression and assembly of heterodimeric ccFv in E. coli. 

Figure lOB depicts the results of an ELISA assay using soluble AMl-ccFv 
expressed in E. coli. n.e results indicate successful assembly of fi.nctional ccFv with 
expected binding specificity to its corresponding antigen. 

Figure 1 1 A depicts a comparison of the antigen binding capability of AMl-ccFv 
expressing phage and that of AM 1 -scFv expressing phage. The results demonsfrate that 
the phage particles displaying the AMl-ccFv fragments exhibit slightly higher binding 
capability than the phages displaying conventional scFv fragments. 

Figure 1 IB depicts a comparison of the antigen binding capability of AM2-ccFv 
expressing phage and that of AM2-scFv expressing phage. The results indicate that the 
binding capability of the phage particles displaying the AM2-ccFv fragments is about 
one order of magnitude higher than that of the AM2-scFv expressing phages. 

Figure 12 depicts three multivalent Abu configurations, each comprising more 
than one basic ccFv unit. 

Figure 13 depicts four bivalent Abu configurations, each comprising a basic 
ccFv unit and a scFv or a dsFv fragment. 
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Figure 14 depicts three trivalent Abu configurations, each comprising one or 
more basic ccFv unit, one or more scFv or dsFv fragment. 

Figure 15 depicts four bispecific Abu configurations, each comprising one or 
more basic ccFv unit with distinct binding specificities, and/or a scFv or dsFv fragment. 

Figure 16 depicts three additional bispecific Abu configurations. 

Figure 17 depicts three trispecific Abu configurations, each comprising at least 
one basic ccFv unit, and at least one scFv or dsFv fragment. 

Figure 18 depicts two exemplary single Abus, in which the heterodimerization 
sequences are arranged in either a parallel or anti-parallel configuration. 

Figure 19 is a schematic representation of ccFv displayed on the surface of a 
prokaryotic or eukaryotic cell. The top panel depicts ccFv displayed by a phage particle 
that is adhered to the surface of a host cell. 



MODEfSl FOR CARRYING OUT THE INVENTION 

Throughout this disclosure, various publications, patents and published patent 
specifications are referenced by an identifying citation. The disclosures of these 
publications, patents and published patent specifications are hereby incorporated by 
reference into the present disclosure. 

General Techniques: 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of immunology, biochemistry, chemistry, molecular biology, 
microbiology, cell biology, genomics and recombinant DNA, which are within the skill 
of the art. See, e.g., Matthews, PLANT VIROLOGY, 2,'^ edition (1991); Sambrook, 
Fritsch and Maniatis, MOLECULAR CLONING: A LABORATORY MANUAL, 2""^ 
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edition (1989); CURRENT PROTOCOLS IN MOLECULAR BIOLOGY (F. M. 
Ausubel, et al. eds., (1987)); the series METHODS IN ENZYMOLOGY (Academic 
Press, Inc.): PCR 2: A PRACTICAL APPROACH (M.J. MacPherson, B.D. Hames and 
G.R. Taylor eds. (1995)), Harlow and Lane, eds. (1988) ANTIBODIES, A 
LABORATORY MANUAL, and ANIMAL CELL CULTURE (R.I. Freshney, ed. 
(1987)). 

As used in the specification and claims, the singular form "a", "an" and "the" 
include plural references unless the context clearly dictates otherwise. For example, the 
term "a cell" includes a plurality of cells, including mixtures thereof 

Definitions: 

The terms "polypeptide", "peptide" and "protein" are used interchangeably 
herein to refer to polymers of amino acids of any length. The polymer may be linear, 
cyclic, or branched, it may comprise modified amino acids, and it may be interrupted by 
non-amino acids. The terms also encompass amino acid polymers that have been 
modified, for example, via sulfation, glycosylation, lipidation, acetylation, 
phosphorylation, iodination, methylation, oxidation, proteolytic processing, 
phosphorylation, prenylation, racemization, selenoylation, transfer-RNA mediated 
addition of amino acids to proteins such as arginylation, ubiquitination, or any other 
manipulation, such as conjugation with a labeling component. As used herein the term 
"amino acid" refers to either natural and/or unnatural or synthetic amino acids, including 
glycine and both the D or L optical isomers, and amino acid analogs and 
peptidomimetics. 

A polypeptide or amino acid sequence "derived from" a designated protein refers 
to the origin of the polypeptide. Preferably, the polypeptide have an amino acid 
sequence that is essentially identical to that of a polypeptide encoded in the sequence, or 
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a portion thereof wherein ihr. 

wneremtneportionconsistsofatleastlmn. • 

Ism 20-30 amino aniH. « 
''™™"^^-™-P-f-b,ya.,ea3t30-50an,inoacids,orwhichis 

~'""-^-«-'-'^-'-P..dee„co.e.inthese,ne„ce™ 

r:r^^-'"--"----™a.je.n.eir. 



A chimenc Protein contains at least one fusion nni 

in a different position in the ^^S'^'"^ 
position in the sequence than that occurs in nature Th. ■ 

nonn.,.e..tin.epa..epro.einsan.areB™,h.,o.eJ:Lr-"^^ 

-y.a.norn,a,,e...„,hesa™epro.eh,h„.!eZra 
feion polypeptide A Chi • "^w arrangement in the 

f jpepnae. A chtmenc prote n may be creafM r™ 

synthesis, or by crea,in.anH, , • ' '""P'^' *™ical 

"y creating and translating a pojynucleolidem which the nentiH • 
are encoded in the desi,^ relationship. 

mo.thlr''^''^"'*'"""''""^'^'"^''*'"-^"— g 

^jr:::— :::::::: 

-..n.«.cpro.emmayalsobecomposedofmoretha„o„epo,^eptideofd.sti:t 
~.ofo.,„a.eteromn,.mer....„s,a..heteromnltim;is^.^ 

p::::r.^"'^"'-^^---"-'-----i 

-^n:;:::""^"™"^''°"'^--'-'----.-to^^ 

fetand "^'■""°™"'-='^™-"P'^-"hete„dime..fo.™edby.he 
^-^^secondpolypeptideorcanfot^higherordertettiarystntctureswheremore 

oligomeric structures. ' ^^""^^ 
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The "first recombinant polypeptide" of a chimeric heteromultimer refers to any 
polypeptide which is or was associated with a "second recombinant polypeptide" via the 
pairwise affinity of two dimerization sequences that are linked to the first and second 
polypeptides, respectively. Preferably, the first and second polypeptides contain 
sequences derived fi-om a hght or a heavy chain of an immunoglobulin. More 
preferably, the first and second polypeptides forni a Nsc Abu that confers binding 
specificity to a desired antigen. 

A "6rsl heterodimerization sequence" refe« .„ any dimerization sequence which 
is orwas associated with a "second hete^dimerization sequence." wherein the second 
heterodimerization sequence differs in amino acid sequence by a, least one amino acid 
residue. A "heterodimerization pair" refers to two heterodimerization sequences 

capable of forming a heterodimer. 

The tenn "antibody" as used herein refers to immunoglobulin molecules and 
inm,nnol„gically active portions of immunoglobuhn molecules, i.e., molecules that 
contain an antigen-binding site which specifically bmds ("immunoreacts with") an 
antigen. Structurally, the simples, naturally occurring antibody (e.g., IgG) comprises 
four polypeptide chains, two heavy (H) chains and two light (L) chains inter-connected 
by disulfide bonds. The immunoglobulins represent a large family of molecules that 
include several types of molecules, such a. IgD, IgG, IgA, IgM and IgE. The tenn 
"immunoglobulin molecule" includes, for example, hybrid antibodies, or altered 
anfibodies, and fragments ther^f. ,t has been shown a,at fl,e antigen binding fbnction 
of an antibody can be perfom,ed by fragments of a naturally-occurring antibody. Tirese 
fragments are collectively temred "antigen-binding units" ( "Abus"). Abus can be 
broadly divided into "single-chain" ("Sc") and "non-single-chain" ("Nsc'O types based 
on their molecular structures. 
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Also encompassed within the terms "antibodies" and "Abus" are 
immunoglobulin molecules of a variety of species origins including invertebrates and ■ 
vertebrates. The term "human" as applies to an antibody or an Abu refers to an 
immunoglobulin molecule expressed by a human gene or fragment thet^f The term 
"humanized" as appUes to a non-human (e.g. »de„t or primate) antibodies are hybrid 
immunoglobulins, immunoglobulin chains or fragments thereof which contain minhnal 
sequence derived from non-human immunoglobulin. For the most part, humanized 
antibodies are human immunoglobulins (recipient antibody) in which residues ih>m a 
complementaty detemrining region (CDR) of the recipient are replaced by residues from 
a CDR of a non-human species (donor antibody) such as mouse, rat. rabbit or primate 
having the desired speciflcity. affmity and capacity In some instances. Fv framework 
region (FR) residues of the human immunoglobulin are .placed by cotresponding non- 
human residues. Furthennore. the humanized antibody may comprise residues which 
are found neither in the recipient antibody nor in the imported CDR or framework 
sequences. These modifications are made to fltrther refine and optimize antibody 
performance and minimize immunogenicity when introduced into a human body. In 
general, the humanized antibody will comprise substantially all of at least one, and 
typically two, variable domains, in which all or substantially all of the CDR regions 
correspond to Utose of a non-human hnmunoglobulin and all or substantially all of flre 
FR regions are those of a human immunoglobulin sequence. The humanized antibody 
may also comprise at least a portion of an immunoglobulin constant region (Fc). 
typically fliat of a human immunoglobuUn. 

"Non-single-chain antigen-binding unit" ("Nsc Abus") are hetemmultimers 
comprising a light-chain polypeptide and a heavy-chain polypeptide. Examples of the 
Nsc Abus mclude but are not limited to (i) a ccFv fragment (Figure 1) stabilized by a,e 
heterodimerization sequences disclosed herein; (ii) any oUrer monovalent and 
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muUivalen, „„,ec„,es eo^pHsin, a. leas, one ccFv fta^en, as descHbed herein- (iii, 
an Fab fi^j^en. consis.i„s of *e VL, VH, CL and CH, domains; (iv) an Fd fra^en, 
— « Of tt,e VH and CH, domains; (v) an Fv frag„,en, cons,seins of .he VL and 
VH d„„a,n3 of a single arm of an anUbody; (vi) an F(ab-)2 fta8„,en,, a bivalen, 
fragmen. compHsing «vo Fab fra^en. Hnked by a disulfide bridge a, d,e htage region- 
(vn) a d,abody; and (viii, any od,er Nsc Abus m are described in U„,e « al. (2000, ' 
Immunology Today. 

As noted above, a Nse Abus can be either "monovalem" or "multivalent " 
Whereas the fonner has one binding site per antigen-binding untt. the latter contains 
mulfple binding sites capable of binding to more than one andgen of the same or 

d.ife,™„md. I>«P»dingon.he„umberofbhrdingsites, aNsc Abusmaybebivalent 
(havtng two antigen-binding sites), trivalent (having three antigen-bindtng sites, 
tetravalent (having four antigen-binding sites,, and so on. 

Multivalent Nsc Abus can be teher classified on fl,e basis of their binding 
specificities. A "monospecific" Nsc Abu is a molecule capable of binding to one or 
more antigens of the same Irind. A -^ultispecific" Nsc Abu is a molecule having 
bmding specificities for at least two different antigens. While such molecules normally 
wtll only bind two distinct antigens (i.e. bispecific Abus), antibodies with additional 
spectficities such as trispecific anfibodies ate encompassed by this expression when 
used hetein (see, e.g. Fig.es 15-17). Examples of bispecific »figen binding units 
mclude those with one ann directed against a tumor cel. antigen and the other arm 
dtr^ted again.acyto.oxic trigger molecule such as anti-FcyRl/anti-CDlS. anfi-p,S5 
/FcyRln (CD16,, an.i-CD3/anfi-malignan. B-cell (IDIO), a„.i-CD3/=u,M-p,85 

anMD3/anfi-p97.an.i-CD3/anfi-renal cell carcinoma. an,i-CD3/an«-OVCAR-3 anfi ' 
CD3/L-D, (anti-colon carcinoma,. anfi-CD3/anti-melanocyte stimulafing hormone 
analog. anfi-EGF recep.or/an«-CD3, a„ti-CD3/a„fi.CAMAl, a„.i.CD3/an.i-CD,9 anfi- 
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enzytne acttvate. p.„,„, ^ anti-CD30.a„ti-anca,„e p,„.p,atase (wWc, 

Abus whtch can ,e used as a™„,^, ^^.^ ^ 

P..~..no.e„ac«va,„na>A),a„t.fi.„.an.,^^^^^^^^ 
m,; „ a„ti,e„.«„^„, ^^^^ 

^cepto. auch a. a„ti.,„w density ,ip„p„,e.„ (LDL,/a„,,,^ 

^-..P.Kn...,e..A.s.„..n.tapy„,......,^^^^^^ 

an .-Cm/antt-hetpes ..p,ex vi..s O^SV), anti-T-ceU .cep,„,:CD3 co.p,ex/a„,, 

.nf-uenza, anti.Pc,R/ant.„:V; ^.peeae ABus ro. tt^o. detect™ . w« o.„ „^ 
sucha.3n.i.CEA/ant..B0TUBE,a„,CBA.a„,,p,,,^ 

BsA^aa vaccine adinvants,eePan,e.«.,.„,..,^,,,^^,.^,,_ ; 
oo.a„c.asa„ti.««:,o.a„..,^,,,„,,„^^^^^^ 

-™onea„ti_tat.n,a„,^^^^^^^^ 
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JfiMitigen-binding unit" C"Sr Ahii"^ r-^f^ * 
. , e uiui ^ j»c Abu ) refers to a monomeric Abu 

A..hougHt.etw„d„n,ai„s^?5Sp,,5,,^^ 
s^aetlc ,i„,e. can ,e ntade that enables the„ .o;:;;;2r^r^i^ « e 

™..echainPvCV.v.,asdescHbedi„B.d«„,09SS,.c.„ce.Zi;;^^ 
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Huston et al. (1988) PA^^^ 85:5879-5883) by recombinant methods. Other Sc Abus 
include antigen-binding molecLles stabilized by the subject heterodimerization 
sequences (see e.g. Figure 18), and dAb fragments (Ward et al, (1989) Nature 341:544- 
546 ) which consist of a VH dlain and an isolated complimentarity detennining region 
(CDR). An example of a hnkinl peptide is (GGGGS)3, which bridges approximately 
3.5 nm between the carboxyl teiinus of one V region and the amino tenninus of 
^ another V region. Other linker sLences can also be used, and can provide additional 
functions, such as a means for attJbhing a drug or a sohd support. A preferred single- 
chain antigen-binding unit contain! VL and VH regions that are linked together and 
stabilized by a pair of subject heterAlimerization sequences. The scFvs can be 
assembled in any order, for example! VH-<first heterodimerization sequence).(second 
heterodimerization sequence) -VL ,^r V,-<first heterodimerization sequence)-(second 
heterodimerization sequence) ^VH. 

A "repertoire of antigen-binding units" refers to a plurality of antigen-binding 
units, at least two of which exhibit distinct binding specificities. A genetically diverse 
repertoire of antigen-binding units refers to a plurality of antigen-binding units, the 
majority and if not all of the antigen-binding units exhibit unique binding specificities 
with respect to each other. Genetically diverse repertoire typically has a complexity of 
at least 10^ to 10'^ preferably between lO^o 10^ more preferably between 10«to 10'«, 
even more preferably between 10« to lO" distinct antigen-binding units. 

An antibody or Abu "specifically binds to" or "immunoreactive with" an antigen 
if it binds with greater affinity or avidity than it binds to other reference antigens 
including polypeptides or other substances. 

An Abu is displayed "on the surface of a host cell" when the Abu is presented at 
the outer surface of a host cell. The displayed Abu may be directly attached to the outer 
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surface of *e host cell, or may be indiredy attached to fl,e host cell via a hos, cell 
bound genetic package such as phage particle. 

"Surface presenting sequences" refers to sequences that facilitate display of 
hetet^logous sequences. Topically, the surface presenting sequences are present on «,e 
outer surface of a genetic package, e.g. phage or bacteria. Preferred surface presenting 
sequences of phage is pID of M13 filamentous phage. 

"Antigen" as used herein means a substance that is recognized and bound 
specifically by an antibody Antigens can include peptides, proteins, glycoproteins, 
polysaccharides and lipids; portions thereof and combinations thereof. 

As used herein, the lenn "surface antigens" refers to the plasma membt^ne 
components of a cell. It encompasses integral and peripheral membrane proteins, 
glycoproteins, polys^charides and lipids that constitute the plasma membrane. An 
"integtal membrane protein" is a transmembrane protein that extends across the lipid 
btlayer of the plasma membrane of a cell. A typical integral membtane protein consists 
of at least one "membrane spamring segment" that generally comprises hydrophobic 
amino acid residues. Peripheml membrane proteins do not extend into the hydrophobic 
.nterior of the lipid bilayer and fl,ey are bound to the membrane surfa^ by noncovalen. 
interaction with other membrane proteins. 

The tenns "membrane", "cytosolic", "nuclear" and "secreted" as applied to 
cellular proteins specify the exttacellular and/or subcellular location in which the 
cellular protein is mostly, predominantly, or preferentially localized. 

"Cell surface recepto,." represent a subset of membtane proteins, capable of 
binding to their respective ligands. Cell s^face receptors are molecules anchored on or 
mserted into flte cell plasma membr^e. They consUtute a large family of proteins 
glycoproteins, polysaccharides and lipids, which serve not only as stntctutal constiments 
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of the plasma membrane, but also as regulatory elements governing a variety of 
biological functions. 

A "heterodimeric receptor" encompasses cellular proteins composed of two 
proteinaceons submits which exhibits binding affinity to a ligand. The two 
proteinaceous subunits are distinct molecules which differ in amino acid sequence by at 
least one amino acid residue. Non-limiting illustrative heterodimeric receptors are 
mose that bind to growth factors (e.g. hereguhn), neu,otra„™itters (e.g. y-Aminobutyric 
acd), and other organic or inorganic small molecules (e.g. mineralocorticoid, 
glucocorticoid). Preferred he,erod,meric receptors are nuclear hormone receptors 
(Belshaw e, a,. (1996) Proc. Na,i Acad. SC. U. S. A 93(10):4604-4607). erbB3 and 
erbB2 receptor complex, and G-protein-coupled receptors includtng but not limited .0 
opioid (Gomes «<,/. (2m) J. Neuroscience 20(22): RCUO); Jordan « al (1999) 
Nature 399:697-700), muscarinic, dopamine, serotonin, adenosine/dopamine, and 
GABAb families of receptors. 

■•Domain" refers to a portion of a protein that is physically or functionally 
distinguished fom other portions of the protein or peptide. Physically-defined domains 
mclude those amino acid sequences that a^ exceptionally hydrophobic or hydrophihe, 
such as those sequences tira, are membrane-associated or cytoplasm-associated 
Domains may also be defined by internal homologies that arise, for example, from gene 
dupheation. Functionally-defined domains have a distinct biological function(s). The 
hgand-binding domain of a receptor, for example, is fltat domain fl,at binds ligand. An 
antigen-binding domain refers to the pa„ of an antigen-binding unit or an antibody that 
btnds to the antigen. Functionally-defined domains need no, be encoded by contiguous 
ammo acid sequences. Functionally-defined domains may contain one or more 

Physically-deflned domain. Receptots, for example, are generally divided into the 
extracellular ligand-binding domain, a transmembrane domain, and an intracellular 
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effector domain. A "membrane anchorage domam" refers to the portion of a protein 
that mediates membrane association. Generally, the membrane anchorage domain is 
composed of hydrophobic amino acid residues. Alternatively, the membrane anchorage 
domain may contain modified ammo acids, e.g. amino acids that are attached to a fatty 
acid chain, which in tmn anchors the protein to a membrane. 

A "host cell" includes an individual cell or cell culture which can be or has been 
a recipient for the subject vectors. Host cells include progeny of a single host cell. Th. 
progeny may not necessarily be completely identical (in morphology or in genomic of 
total DNA complement) to the original parent cell due to natural, accidental, or 
deliberate mutation. A host cell includes cells transfected In vivo with a vector of this 
invention. 

A "cell line" or "cell culmre" denotes bacterial, plan,, insect or Hgher eukaryotic 
cells grown or maintained /„ vUro. The descendants of a cell may not be completely 
identical (either mon,hologically, genotypically, or phencypically) to the parent cell. 

A "defined medium" refers to a medium comprising nutritional and hormonal 
requirements necessary for the survival and/or growth of the cells in culture such that 
ate components of the medium are known. Traditionally, ,hc defined medium has been 
fomrulated by the addition of nuMtional and growth factors necessary for growd, and/or 
survival. Typically, the defmed medium prx-vides at least one componem from one or 
more of Are following categories: a) all essential amino acids, and usually the basic set 
of twenty amino acids plus cysteine; b) an energy source, usually in the form of a 
carbohydrate such as glucose; c) vitamins and/or other organic compomtds required at 
low concentrations; d) fi« fatty acids; and e) trace elements, whe,. trace elements are 
defined as inorganic compounds or naturally occurring elements that are typically 
required at very low concentrations, usually in the micromolar range. The defmed 
medium may also optionally be supplemented with one or more components from any 
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Of me fo,>„„i„g categories: a, one or ^ore ™i.ose„ic age„.; b, sa,. and buffe. as for 
example, caici™. ma^esiu™, a„<, ph„spba,e; c) nucleosides and bases such as for ' 
example, adenosine ar,d ti,,midine, hypox^ttune; and d) protein and .issue 

hydrolysates. 

As used herein. «,e .enn "isolated" means separated from consdtuents. cellular 
and othen^ise. in which the pol^ucleotide, peptide, pol^eptide, protein, antibody or 
fragments dtereof. are nonnally associated wid, in nature. As is apparent to those of 
*ll m the art, a „„„.„at^„ „ccu™g the polynucleotide, peptide, polypeptide 
protein, antibody, or tragments ther^f, does no. re,ui,e "isolation" to distinguish ,. 
m>m i.s nan-rally occurring coun.erpan. U add..i„„, a "eoncen,ra.ed", "separated" or 
"Cluted" polynucleotide, peptide, polyp^tM, p,,ci„, antibody, or ira^ents Utereof 
.s distinguishable from its naturally occutring countetpart in that the concentration or ' 
number of molecules per volume is ^ater than "concentrated" or less dtan "sepatated" 
than diat of its naturally occurring counterpart. 

Enrichment can be measured on an ab»lute basis, such as weight per volume of 

».ut,on,„ri. can bemeasuredinrelauontoasecond,potentiallyin.erfering substance 
Ptesent m the soutce mix^. hrct^i^, ^^^^^ ^^^^ ^^^^ ^^^^^ 

aremcreasinglymoreprefer,ed.™us,forexan^le,a2-foldemichmentisprefen«i 
10-fold enrichment ismoreprefetred, 100-fold enrichment is mo. p^ferred. 1000-flld 

ennchm.tiseve„mo.prefe„ed.Asubs,anceca„alsob.p,»videdin»isolateds.ate 
by a pmcess of artificial assembly, such as by chemical syndtesis or recombinant 

expression. 

'"-d"lused"or"fusion"a.,.edmte.changeab,yherei„. Thesetenns 
refer to the Joining together of two more chemical elements or components, by whatever 
means including chemical conjugation or tecombinant means. An "in-frame fi.ion" 
refers to the Joining of two or more open reading frames (OFRs) to fo™ a continuous 
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longer OP^ to a .a^er ma. maintains tf,e co„ec. reading ftame of the oHginal OFRs 
Thns, me resulting recombinant fttsion protein is a single p^ein containing two ore 
more sclents that co^espond to polypeptides encoded by the otigina, OFRs (which 
sclents a. not no,™al,y so joined in nature.) Although the reading frame is thns 
made continuous throughout me feed segn,ents, the sclents may be physically or 
spattally separated by, for example, in-frame linker sequence (e.g -flexon'^ as 
described infra. 

m me context of polypepttdes, a "linear sequence" or a •■sequence" is an otder 
ofam.no acids in a polypeptide in an amino to carboxy. tenninus direction in which 
.^sidues mat neighbor each other in mc sequ„ce are contiguous in me primaty stracture 
Of me polyp^tide. A "partial sequence" is a linear sequence of part of a polypeptide 
whtch ,s known to compnse additional residue i„ „„e or bom directions. 

"Heterologous" means derived from a genotypically disUnct entity from me rest 
of me entity to which it is being compared. For example, a promoter removed fi»m its 
nattve coding sequence and operatively linked to a coding sequence ofter than me 
nattve sequence is a heterologous promoter. The tenn "heterologous'' as applied to a 
polj^ucleotide. a polypeptide, means mat me polynucleotide or polypeptide is derived 
ftom a genotypically distinct entity from that of me res. of me entity to which it is being 
compared. For ins,a„ce. a heterologous po,ynucl«>«de or antigen may be derived fom 
a dtfreren. species origin, differen. eel, type, and the s^e type of cell of distinct 
individuals. 

He temts >,ynucleotides», "nucleic acids", -^eoddes" and "oligonucleotides" 
are used interchangeably. They tefer to a polymeric fonn of nucleotides of any lengm 

etmer deoxyribonucleoUdesor ribonucleotides, or analogs mereofPolynucleotidesmay 
have anym,ee^e„,„„^ structure, and mayperfo™ any funcUon, known or tu^own 

lT.= fo,lowinga,enon-hmi«ngexamp,cs„fpol^uc,eotides:codingornon.c«.i„g„gio„s 
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of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, 
messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, recombinant 
polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any 
sequence, isolated RNA of any sequence, nucleic acid probes, and primers. A 
polynucleotide may comprise modified nucleotides, such as methylated nucleotides and 
nucleotide analogs. If present, modifications to the nucleotide structure may be imparted 
before or after assembly of the polymer. The sequence of nucleotides may be inteirupted 
by non-nucleotide components. A polynucleotide may be fiirther modified after 
polymerization, such as by conjugation with a labeling component. 

"Recombinant" as applied to a polynucleotide means that the polynucleotide is 
the product of various combinations of cloning, restriction and/or hgation steps, and 
other procedures that result in a construct that is distinct from a polynucleotide found in 



nature. 



The terms "gene" or "gene fragment" are used interchangeably herein. They 
refer to a polynucleotide containing at least one open reading frame that is capable of 
encoding a particular protein after being transcribed and translated. A gene or gene 
fragment may be genomic or cDNA, as long as the polynucleotide contains at least one 
open reading frame, which may cover the entire coding region or a segment thereof 

"Operably linked" or "operatively linked" refers to a juxtaposition wherein the 
components so described are in a relationship permitting them to fimction in their 
intended marmer. For instance, a promoter sequence is operably linked to a coding 
sequence if the promoter sequence promotes transcription of the coding sequence. 

A "fiision gene" is a gene composed of at least two heterologous polynucleotides 
that are linked together. 
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A gene "database" denotes a set of stored data which represent a collection of 
sequences including nucleotide and peptide sequences, which in turn represent a 
collection of biological reference materials. 

As used herein, "expression" refers to the process by which a polynucleotide is 
transcribed into mRNA and/or the process by which the transcribed mRNA (also 
referred to as "transcript") is subsequently being translated mto peptides, polypeptides, 
or proteins. The transcripts and the encoded polypeptides are collectively referred to as 
gene product. If the polynucleotide is derived from genomic DNA, expression may 
include splicing of the mRNA in a eukaryotic cell. 

A "subject" as used herein refers to a biological entity containing expressed 
genetrc materials. The biological entrty rs preferably plant, animal, or microorganisms 
including bacteria, viruses, fungi, arrd protozoa. Tissues, cells and their progeny of a 
I ^ biological entity obtained in vivo or cultured /„ vitro are also encompassed. 

A "vector" is a nucleic acid molecule, preferably self-rephcating, which transfers 
an inserted nucleic acid molecule into and/or between host cells. The term includes 
vectors that function primarily for insertion of DNA or RNA into a cell, replication of 
vectors that function primarily for the replication of DNA or RNA, and expression 
vectors that function for transcription and/or translation of the DNA or RNA. Also 
included are vectors that provide more than one of the above functions. 

An "expression vector" is a polynucleotide which, when introduced into an 
appropriate host cell, can be transcribed and translated into a polypeptide(s). An 
"expression system" usually connotes a suitable host cell comprised of an expression 
vector that can function to yield a desired expression product. 

A "replicon" refers to a polynucleotide comprising an origin of rephcation 
(generally referred to as an on sequence) which allows for rephcation of the 
polynucleotide in an appropriate host cell. Examples of replicons include episomes 
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(such as plasmids), as well as chromosomes (such as the nuclear 
chromosomes). 



or mitochondrial 



Chimeric Heteromiilt.m>>r nf the Present Tnv.nH^„ 

As noted above, proper assembly of polypeptide subunits to fonn a stable 
complex is r^ired to ensure the biological function of a multimeric protein. 
Accordingly, aWl aspect of the present invention is the design of a technique that 
enables specific asW of selected monomeric polypeptides to effect efficient 
production of heterom^mers. The experimental design is particularly useful for 
generating and screening fWeromultimers such as Abus whose binding specificities 
depend on the assembly of spe\fic subumts in a specific mam.er. Distinguished from 
the previously reported chimeric V the subject Abus have one or more of the 
following unique features. First, the are reconstititued via pairwise affinity of two 
heterodimerization sequences, at least oneWhich and preferably both of which, 
lack(s) detectable propensity to fonn homodin^. Unlike the previously reported 
dimerization sequences such as Fos and Jun leuci^ippers that are known to form 
homodimers under both physiological buffer conditio^d physiological body 
temperature (O'Shea et al. (1992) Cell 68: 699-708; Vidai^/. (1996) Proc. Natl 
Acad. Set U.S.A), the subject heterodimerization sequences ar^Vapable of fonning 
homodimers either under the specified buffer conditions and/or .X^^^,,, body 
temperatures. The subject heterodimerization sequences may be fiulh^^tinguished 
from the previously employed sequences at the stmctural level as detailed i^w. 

?!^^l^fmbod^^ present invention provides a chimeric heteromultimer 
'"P''^^' °" *-^^^^^^^?^Kel^^ ^^^p^^^^^ ^.^ ^ ^^^^ 

polypeptide fiised to a first heterodimerization s^^uJ^i^^Z:^:;^^^^^^ 
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sequence; (ii) a second polypepti^ fused to a second heterodimerization sequence; 
wherein the first and second polyJeptides dimerize via pairwise affinity of the first and 
second heterodimerization sequen|tes; wherein at least one of the heterodimerization 

a homodimer under physiological buffer conditions 
ratures. 



sequences is incapable of forming 
and/or at physiological body temp^: 

In another embodiment, the present invention provides a Nsc Abu that 
comprises: (a) a light-chain polypeptide comprising a light-chain variable region linked 
to a first heterodimerization sequence; (b) a heavy-chain polypeptide comprising a 
heavy-chain variable region linked to a second heterodimerization sequence; wherein 
the light-chain and the heavy-chain polypeptides dimerize via pairwise affinity of the 
first and second heterodimerization sequences, at least one of which is essentially 
incapable of forming a homodimer under physiological buffer conditions and/or at 
physiological body temperatures. In another aspect, the present invention provides a 
Nsc Abu whose the Hght-chain and the heavy-chain polypeptides dimerize via pairwise 
affinity of the first and second heterodimerization sequences that are derived from 
heterodimeric receptors. In one aspect, the first and second heterodimerization 
sequences comprising heterodimerization receptor sequences that mediate 
heterodimerization of the receptors. 

In a separate embodiment, the present invention provides a Sc Abu which 
comprises a light-chain variable region and a heavy-chain variable region comiected by 
a first and a second heterodimerization sequence spanning the distance between the C- 
terminus of one of the region to the N-terminus of the other region, wherein the two 
regions form an inh-a-molecular dimer via pairwise affinity of the first and second 
heterodimerization sequences, at least one of which is essentially incapable of forming a 
homodimer under physiological buffer conditions and/or at physiological body 
temperatures. In another aspect of this embodiment, the present invention provides a Sc 
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Abu wherein the Hght-chain variable region and the heavy-chain variable region form an 
intra-molecular dimer via pairwise affinity of two heterodimerization sequences that are 
derived from heterodimeric receptors. In one aspect, the first and second 
heterodimerization sequences comprising heterodimerization receptor sequences that 
mediate heterodimerization of the receptors. 

Selection ofHeterodimerization Sequences: 

Several factors apply to the design of Abus having one or more of the above- 
mentioned characteristics. First, the heterodimerization sequences must exhibit pairwise 
affinity to effect formation of a stable complex. By "stable" is meant that the complex 
or dimer is sufiBciently long-lasting to persist between the formation of the complex or 
dimer, and its subsequent detection and/or purification. The complex or dimer must be 
able to withstand whatever conditions exist or are introduced between the moment of 
formation and the moment of detection, these conditions being a fimction of the assay or 
reaction which is being perfbimed. Preferably, the fomation of the complex or dimer is 
carried out under physiological buffer conditions and at physiological body temperatures 
ranging from approximately room temperature to approximately 37°C. Intervening 
conditions which may optionally be present and which may dislodge a complex or dimer 
include washing, heating, adding additional solutes or solvents to the reaction mixture 
(such as denaturants), and competing with additional reacting species. Stable complex or 
dimer may be irreversible or reversible, but must meet the other requirements of this 
definition. Thus, a transient complex or dimer may form in a reaction mixture, but it does 
not constitute a stable complex if it dissociates spontaneously under physiological buffer 
conditions or as a result of a newly imposed condition or manipulation infroduced before 
detection. 
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' Second, fte selected heterodtaerizadon sequences ™s. exhibit pai^se affinity 

resulfng in pW„», ^^^^^^^^ ,^ ^ 

ho.odi.ers.P,Wab,y.thep,edon,in.tf„t,na.i„nyie,d.ahete™n,tin,eticpo„,tHat 
contains at ieast 6oWr„di„ers, .no.e p.fe„b,y at least 80«/. Wodi„ers. „ote 
preferably between 8 W beterodimers. and „ore pt^ferably between 90-95% 
heterod^ets, and even Lre pt^ferably between 96-99o/. beter„din,e,. tba. are allowed 
.0 fon„ under pbysiologica\bu«fer conditions and/or physiological body ,en,pe,atures 
certain entbodintents of tbVes ent invention, at least one of the beterodin,eriza,io„ 
sequences employed to reconsttW an Abu is essentially incapable of fonning a 

hon,od,n,eri„apbysiologica.bufLand/oratphysiologicalbodyten,peratu,e By 
"essentially incapable" ,s nre.. tba,\e selected hete™.i„erization sequences when 
-ted alone do no, yield detectable an.oVs of hon,odin.e. in an . sedimentation 
expeHnten. as detailed in Kanunerer e, ./.W, 38: 13263-13269, or in 

fte ,„ v,Vo two-hybrid yeas, analysis (see e.gW.e « ,,,,3, 3,,. 

682). in addition, individual heterodintetizatioVequences can be expressed in a host 
cell and the absence of bomodimers in the host ee\can be demonstrated by a variety of 
proten, analyses deluding but not limited to SDS-P^^E. Western blot and 
tmnrunoprecipitation. The ,„ v,.. assays must be conicted under a physiolo^cal 

buffer conditions, and/orpreferablyatphysiologicalbodXtemperatures. Generally a 
Physiological buffer contains a physiological co„centration\fsa,. »d at adjusted ,1 a 
neutral pH ranging flom about 6.5 to about 7.8, and pteferabLom about 7.0 .0 about 
7.5. Avatietyofphysiological buffer, is listed in Samb^ok A. (,989)»^..and 
hence is not detailed herein. Prefetred physiological conditions ^described in 
Kammerer et al , supra. \ 

The specific association of heterodimerization sequences typically involves 
noncovalen. interactions. Such mteractions encompass eve^ exiting stable linkage that 
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does not result in the formation of a covalent bond. Non-limiting examples of 
noncovalent interactions include electrostatic bonds, hydrogen bonding, Van der Waal's 
forces, steric interdigitation of amphiphilic peptides. 

A further consideration in designing the subject Abu is to minimize any 
structural interference between the heterodimerization sequences and the antigen- 
binding site of the resulting heteromultimer. A variety of techniques is available in the 
art for designing a chimeric heteromultimer with minimal internal structural 
interference. For instance, one approach involves the use of minimal heterodimerization 
sequences containing only amino acid residues that are required for heterodimerization. 
The second approach is to link the heterodimerization sequences to either N-tenninus or 
C-terminus of the resulting heteromultimer. The choice of either terminus will depend 
on the location of the biological active domain of the heteromultimer. For constructing 
a chimeric Abu whose antigen-binding site resides in the N-terminal half of the hght and 
heavy chain variable regions, it is preferable to link the heterodimerization sequences to 
the C-terminus of a light or heavy chain. Another alternative design employs a "flexon" 
incorporated between the antigen-binding site and the heterodimerization sequence of 
the heteromultimer. "Flexon" as used herein, refers to a flexible polypeptide linker (or a 
nucleic acid sequence encoding such a polypeptide) which typically comprises amino 
acids having small side chains (e.g. glycine, alanine, valine, leucine, isoleucine, and 
serine). Incorporating flexons between one or more sites of the subject Abu is believed 
to promote functionality by allowing them to assume a conformations relatively 
independent of each other. Such a construction generally provides additional flexibihty 
to the antigen-binding domain. Suitable flexons preferably comprise between about 
four and about one hundred amino acids, more preferably about four to fifty amino 
acids, and even more preferably about four to fifteen amino acids. 
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14 



"^•-«--«onse,ue„cesapp,icab,eforco„s,n.c,i„gU,esubJec.Ab.c» 
e denved fro™ a vaHe.y of Ge„e.„y, an, p,o.ei„ se,„e„ee. involved in .he 

fo™a,io„„f,.,,e,e,e.o„u,«„e,sa„ea„dida.ehe,e™di„eH.a.^^ 
such, .he. sequence. „ay he derived fr„„ he.e.on,u„™erio p..ei„ eo„p,e.e. 
Repre.e„.a.ve ca„dida.e 3e,„ences a. vi^ p^eins s„ch as .he capsid p.o.eh,s of ' 
adeno.associa.ed vin,ses, pr„.ei„ ph„spho^,a.i„n si.s *a. i„.erae. wi.h SH2- 
don,a,„.„n.ai„in„„„i^ (Can.e,y (,993) CeUU: 767-778. Can.e,y«./ 
0995).. B^o, ae.. .70(44,: 26029-26032), do.ah,sof .anscripdon faCo. and 
he.eK,dimeric recep.o.., which mediaie he.eromer fonnation. 

P^fe^d he.ero„n.d„eric .a„scrip,on facors are a-Pa,/ Max complexes and 
Hox^hx complexes. Hox .present a large fa„,i„ of na„scrip.i„n facors involved in 
pa..emi„g .he an,eri„.p„s.erior axis dnHng emhryogenesis. Hox p„.ei„s htad DNA 
w.m a conserved toee alpha hCix ho„eodon,ai„. h, order .o hind .o specific DNA 
sequences, Hox pro.ein, require tt,e presence of he.ero-parme. such as d,e Phx 
|>on.eodon,ain. Wolhcrger«„, so.ved*e2.35Acrys.aism,o.ureofa„oxB,-Phx,- 

DNA.e™a„eon,p,exi„„^er.ou„ders.a„dhow„„x.Phx complex fo™a.ionoccurs 
and how .h.s compiex binds .o DNA. The s^c«.e shows fta. ,he hon,eodon,ain of 
each p„.ein btads .o adjacen. recognition sequences on opp„3i.e sides of .he DNA 
He.erodimeHza.ion occurs .hrough con.ac. fo™,ed beUveen a six amino acid 
exapepHde N-.e™i„a, . «,e homeodomain of HoxB. and a poc.e. in Pbx , f„™ed 

o™sa,.,hahe,ix.ha.pac.sas.„3.he,ix,.ofo™a,argerfourhe,ixhomeodom.n 
(Woiberger.,., (,999) C.«,6: 587-597; Wolberger . 

A vas. number of he.erodimeric .ecep,„. have aiso been ide„.ified. They 
-ude bu. are no. hmiied .o *ose .ba. bind .„ ^w«, f„o. (,g. ,,,^„„, 
neuro.a„smi„ers (e., .-Aminobuiyric acid), and o«her organic or inorganic sma,, 
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heterodimerfonnation. 

5^^-'"^- ~ »o„ novel Kete^dtaeHc .ecep,„. can be employed 

™ode,,nsa„asea.«n,.ec^\..^^,^,„.^,^^^,^^^^^^ 

~"^°"~^°"^:o.eon™„„aon,a.nsappe„e...uZ: 
u-e>atedgenes.No„-W,ingexan,pteXro«ams,ha. „ T 

Blast (liHp.//www.nebi.nta.nih.gov/BLASTA\„. ■ 

. '^'^^'^ ("Genetics Computing Group 

co.e.pon<.gtoa.a.ge.,eeep.o.o.a.^en..,e.eofcanX.^ 

-..Co„»„„,y»p.,e..ta.a.es,„e...ta.not„n,SL 
BMBL, DDB, POB, SWISS-PROT. EST. STS, OSS. and HTOS \ 

^""'-^^'--fhe.erodin.eH.ation sequences eonsist. Of an^phiphme 
Pep..e..a.a.optacoi,e..eoU.e«eaU..e.™e.™e«ea,eo.,eac„.,^ 

■189). Well-charactenzedcoiled-eoil-eontaining proteins 
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inc,„ae„e™.e.„f.Hec.,„.M«a,fa™i,,(..,.,.„,„, ^^^^^^^^^^^ 
™.or family (e.g. ^y^^^, ^^.^^^^ ^. ^^^^^ ^^^^^^^^ 

(e.g. GABA, .ceptors . and 2). Co.,ed-coi, he,erodi„enza.i„„ sequences of fte 

Pres.Unve„,i„nca„,eb„aa,yolassifiedin.„^„^„ps,na™ely«,e,e«.handeda„d 
nght-handed coiled coiK T.e .eft-handed coiled co.,s are ci,a.ac.eri.cd by a hepiad 

jea.d.o.ed..aW...^..eocc»enceofapo,a..esidnesp^ 

tte (a, and fou«h (d) posi.i„„. Tie .sidues a. d,ese ,wo posi.i„ns .ypic^.y 

3-2° -'ih^fi«ns h«,c CO.. ,„ eon,^, ^ 

(0 posi^o. ,ha, cove, «,e periphery of *e coiled coi, are preferably charged 
-dues. Examples of charged a„,i„„ acids include basic residues such as lysine 
ar^nine, h.s«d.ne, and acid, residues such as aspar.a,e, gl„.a„,a.c. asparagine. and 
S...anune. Uncharged or apolar anrino acids sui«e for desiring a he.e„dinrcric 

co..edcoi,includeb„.a„„o„in,i.ed.o,ych,e,alan,ne, valine, leucine, isoleucine 
senne and threonine. While rhe uncharged residues typically fom. a,e hydrophobic' 
core, inter-hehcal and i„.„.helical sah-bridge inCudrng charged residues even a. cor. 
posmons may be employed .o stabilize .he overall hehcal coiled-coilcd structure 

(Burkhardaai (2000),/ S,o/.Cte»,. 275:11672-116771 Wk 

^ ., "'"'""'")■ Whereas varying lengths of 

ocled CO., may be employed, the subject hete„din,erizatio„ se,ue„ces prefe^.y 
contain two to ten heptad repeats. More preferably, dte heterodhnerization scuences 
con.amthreetoeightheptadrepeats.evenmo.prcfcrablycontai„fourtofiveheptad 



repeats. 



to<iesigningoptimalcoiled.„ilhetcK,dimerizationse<,„ences,avarietyof 
cx.s«ng computer software programs that predict the secondary stn^cture of a peptide 
can be used. An illustrative computer analysis uses the COILS algorithm which 
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compares an amino acid sequence with sequences in the database of known two- 
stranded coiled coils, and predicts the high probabihty coiled-coil stretches (Kammerer 
et al{l999) Biochemistry 38:13263-13269). 

Whil^diverse variety of coiled coils involved in hctero-„,igo„eriza«on can be 
employed in ,hcW, invention, prefced coiled coils a,e derived ftom hCerodimeric 
receptors. AccordingiW present invention encompasses the coiled-coil dimeric 
sequences derived fron, exceptors 1 and 2. b one aspect, the subject coiled 
cotls comprise the C-tenninal se,^„f OABA, receptor 1 and GABA, receptor 2 
In another aspect, the subject coiled co^Jw„„p„^ ^„ ^^^^^^^ 
at least 30 amino acid resrdues, one of which Xntially identical to a linear sequence 
of comparable length depicted in SEQ ID NO. 2, and is essentially identical to 

a hnear peptide sequence of comparable length depicted in SE^ID NO. 4. 

A linear sequence of peptide is "essentially identical" to another linear sequence if 

both sequences exhtbit substantial aminoacidseqnencehomology.Genetally.ess^tially 
Identical sequences ate a, least about 60«/. identical with each other, after alignment of tite 
homologous regions. Preferably, the sequences are a. least about 70o/. identical- more 
preferably, they are at lea« about 80% identic^; more preferably, ti,ey are a. least about 
90% identical; more pt^ferably, the sequ^c^ ^ at least about 95% identical; stiU more 
preferably the sequences are 100% identical. 

todetenniningwheter polypeptide sequences ate essentiallyidentical.asequence 
ti.a.p,.se,ves the toctionalityofflte polypeptide wim which i, is being compated is 

particu,ar,ypref^.F„„ctionalitymay be established by diff^entcriteria, such asability 
to fomt a heterodimer with a pairing coiled-coil sequence, and inability to fonn a 

homodin^erunderphysiological buffer conditions and/orphysiologicalbodytempetatitres. 

The invention includes modified GABAe heterodimerization sequences which 
are tunctionally equivalent to the sequences exemplified herein. Modified polypeptides 
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indude those with _ive s.m,.,ons of a„,i„o acd res.dues and 
one or mo. demons or additions of amino acids which do no, significantly 

de.e.eno„s,y aiterthehoterodimerization specificity. Snhstitutions can range fiom 
changmg or modifying one or more amino acid residues to complete redesign of a 

-Stona^longasthepainviseaffinityismaintained. Amino acid snbstihttions if 
present, are preferably conservative substitutions that do no. deleteriousiy aff.t folding 
or functional properties of the peptide. Groups of tunctionally related amino acids 
wthtn which consen-ative substitutions can be made are glycine/alanine- 
vahne/isoleucine/leucine; a.paragine/g,„tamine; aspartic aeid/glutamic acid- 
senne/threonine/methionine; lysine/argimne; and phenylalanine/tryosine/tryptophan 
Polypeptides of this invention can be in glycosylated or unglycosylated form, can 
modtfied post-translationally (e.g.. acetylation. and phosphorylation) or can be modified 
synthetically (e.g., (he attachment of a labeling gtoup). 

Co„/,gura,io«s ant Modifica,U,ns ofAnHgen-Biniing Uni,s (Aius): 

IheAbu.„fthepresen,inventioncanad„p,avarietyofconfig..a,ions. The 
smallest non-single chain Abu is a monovalent ccFv fra^ent. Tie ccFv flagment is a 
d-nteno protein composed of VL and VH regions, which dimetize via the pairwise 
afflntty of the first and second hetetodimerization sequences toed in-fiame with the VL 
»d VH regions, respectively. Preferably, the ccFv contains a short flexon sequence that 
provides additional flexibility to the VL and VH regions (see an exemplary ccFv in 
Ftgure 1). AnrorecomplexNscAbuisamultivalentmoleculecapableofbindingto 
more than one antigen of the same kind (i.e. multivalent but monospecific) or diiferem 
lend (,.e. multivalent and multispecific Abus). Typically, a multivalent Abu is a 
heteromultimer composed of more than one L and „ chain polypepHdes, in which either 
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«.eLor„p„„p^«,e„,,„ftco„t™™.«,a„„„eVregi„„. F„H.,a„ee a„ 
ITte H ch.„ p„,yp.p,„e ^ i„„3„^,, ^^^^ ^^^^^ 

eac.0fw«cM..e„.e....aV..e.„„.„.„3ti..,e.„»..,^^^^^^^ 
A.te.at,ve,,«,eLchai„p„,„„,,^„,,^^„^^_^^^__^^^^^___^^^^^-^ 

™»...t.aVH„,„„,„„co„..„,,,,„,„,,„^^_,^^ 

'""77"'""'^^*"---''--f«--f.etwo..e™.^^^^^^^^ 
ue„c.,.„.e..„..v.a„,V„„3.o..T.e.B„.a„eff.^^^^^ 

no„a„c«„„a.VHM,o.VZ^.._.pp,,„3,3.e„e„,a„H,«,,.^^^^^^^ 
^che^e. tnvalen. a„a tetravalen. Ab. can be eonstructed (see, e., Pi^ 

A vanan. app„ach to co,.m,c, ™,.,va.e„, Abus employs a scFv or dsFv 

onea„.„„ssUe,Ab„so™sco„f.su.tio„co„tai„.^ 
^^e„t3,.„.,^,,,,^^^^^^^,^^^^^^^^^^^^^^^_^^^^^_^^^ 

conf.^tio„(.^,3,,,_„„^„,,^^^^^^_^^^^^^^_^_^ 

pa™sea«„,of.beHet.o.„e.at.„„.,„e„e.,.ea.o.beV.„.o„a„.^^ 

^.o„(a3<„ecPv),tbeo«.eH.p„vMedb..bescPv„r..K.a^e„.,,.,,3,,. 
^-.t.tbev..,„„.,,_,,,,,^^^^„^_^^^^^^^^_^^^^ 

Vfi region. 

The san^e approach car, be employed to generate triva,e„. ccPvscPv or ccFv- 
dsPvAb.a..bo™i„Pi^,,.^„„^^^,_^^^^^^_^_^^^^^^^___^^^^ 

configuration of coPv,.P.,. ^„ „^^_^^ be.e.din,eri.atio„ 

.e,.ence.cFv..and..VL.econdbe,e,.di„eH..i„„se,„c„ce^P...,,„,,,„^^,^^ 
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P^^ise affinity „f u,e he.erodi™eri.«„„ sequences .„ .„„3«„,e tee binding 
s..es. One of the binding sites is eo™pos«i of fte VL and VH regtons of the ccFv 
bu.ldi„g units; the remaining ,„„ provided by the scFv ,ragn,ents linked to the 
respective VL a. VH poIypeptHes. Altentatively. the multivalent Abus can be 
conflgu^d as ccFv-scFv-dsFv. In this configuraUon, one of the antigen-binding sites is 
assembled and stabilized via an inter-molecular disulfide bond betwe« apair of 
cysteine residues that is located within the VHand VLregions of thedsFv fragment a 
ftmher vatiant of this conflation is a trivalen, ccFv-(dsFv),. in which two of the 
b-ndtng sites assume the dsFv fonnat (see. e.g. Figure ,4, Any other multivalent Abus 
vanants emplojdng the basic ccFv building unit, whether being monospeciflc or 
mulhspecifc. are encompassed by this invention. 

Accordingly, this invention fluther pravides multispecific Abus. They a,, 
mulhvalen, molecules capable of binding to at least two distinct antigens. Prefc^ed 
mulfspecific Abus are bispecific and trispecifie molecules exhibiting binding 
speciflcities ,o two and Urree distmct antigens, respectively. Distinguished ftom 
prevtously chatacterized multispecific antibodies (see e.g. U.S. Patent No 5 932 448) 
fte subject multispecific Abus comprises one or more ccFv butlding unit with distinct' 
bmding specificifies. The subject multispecific Abus can also incorporate one or more 
SCFV or dsFv fragments as detailed above. Prefen^ bispeciflc and trispecific Abus are 
configured a«ording to the general structures depicted in Figures 15-17. 

Aside flom non-single chain Abus. the present invention includes single-cham 
Abus that is stabilized by the subject heterodimerization se,ue„ces. Typically, the Sc 
Abus comprises a VL and a VH region forming an intra-molecular dimer via the 
patrwise affinity of fl,e heterodimerization sequences comtected to these two regions 
The heterodimerization se,uences may be confi^,^, i„ either a parallel or anti-p„allel 
manner (see. e.g. Fi^e ,8). In a parallel configuration, the two heterodimerzation 
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sequences a. aligned such ,ha. a,ey have fte same „rie„,a,i„„ ,a.i„„..e™i„a, ,„ 
ca*oxy,..enni„a.). U an a„«.pa.a„e, conHs^tton. ,he he,er„din,eriza«„n sequences 
are arranged such ,ha. .he amino-tenninal end of one se,uence is aligned wift dre 
«rboxyl.,e™i„aI end ofme other sequence, and vice ver^a. Generally fte 
ire.erodin,eri3a«on sequences are linked .ogeta via a flexon sequence. As described 
herein, flexon is a flexihle polypepride «er ,or a nucleic acid sequence encoding such 
a polypeptide) which typically con,p,.,« annno acids having small side chains (e g 
glycne, alanine, valine, leucine, isoleucine. and serine), hrcorporating flexons between 

U.etwoheterodin,eH.adons«,„encesgenera„yprovidesspa.iainexibilityfor,hemto 
form an intra-nrolecular dimer. Suitable flexons for the anti-parallel configuration 
preferably comprise between about four to about one hundred amino acids more 
P^ferably about four to flfiy amino acids, and even more p.ferably aboutfo. to 
fifteen amino acids. Flexons for the parallel configuration are generally longer 
preferably ranging from about ten to about one hundred amino acids, more pref^bly 
ftom about fifty to about thirty amino acid residues. 

Where desired, one or more pair, of cysteine residues may be incorporated at the 
N- or C- temrinus of the heterodimerization sequences to tother stabrlize the Abus of 
the present invention. 

The Abus of this invention may contain sequences derived from the constant 
..S-ons of an L chah, or a H chain. Such sequences derived ftom the constant regions 
are gene^lly placed between a light-chain or a heavy-cham variable region and the 
heterodimerization sequence to which it is hnked. In addition, the fight and heavy 
chatns may contain partly or entirely human sequences. 

Methods for humanizing non-human antibodies are well known in tire art 
"Humanized" antibodies are antibodies in which at least par, of the sequence ha. be. 
altered fi.m its initial fo,™ ,„ render it more like human immunoglobulins, h, one 
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0329400. I„a«ve.,„„,v.,„„..,e,„_,,,^,^,^^^^_ 
sequences of a.d „ou.e V regions, and „s,dues ou.side *e CDRs 

lhat are different between the consensus sequences. 

In making humanized antibodies, the choice of iramework residues can he 

cnt.ca.inre.aini„ghighbindu,,a«in,.,.h,pru,ctp,e,atVan,eworkse,ueucefiomany 

HuAhcanse„eas.hetentpIa.eforCDRsrafti„g;ho„ever,ithasbee„den,o„st,.ted 

haU,3,h.CDRrepiacementi„t„sucha,ameworkcan,eadtosi,i,cant,ossof 
bmding affinity to die antigen. Glaser « ,7/ ( 1 007 w , 

""■""^^■'■■''"'"""°'1'<9:2606; Tempest a 

more homologous a HuAb is ,0 the original muAb, the ,ess >*e,y that the hu^an 
framework wiU inh^duce distonions into the n,u.ine CDRs that couid reduce afHnity 

B.eaonase,u»cehomo,ogysearchagainstanantibodyse,uencedatabase,«re„„Ab 
provides good framework homology to muM4TS.22, although other highly 

homo.ogousHuAbsw„„,dbesui.ab,easwell,espec,anykappaLchains.^on,hn„a„ 
-b^oup, or H Chains S^^htunan subgroup m. Kabat«.,. (,987) Various 
computer pr^^ams such as ENCAD (Levin e,a,. (.983,^ Mol. Bio,. ,68-595, are 
available .0 predict the ideal sequence for the V region, i„ve„,i„n thus 

enco„passes„uAbswithdiffere„tVregions.:tiswi.h,n.heskillofoneintheartto 
detennine suitableVregionsequencesand to optimi^these sequences. Methods for 

o.an,.gan,,„,,3..hr.„cedi™niunogenicitya.also described i„U.S.Pate.^ 

5,270,202 and EP 699,755. 
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«>ea^«,.a„ao.h.fav„.He.o,„,ica,p.„pe„ies.T„acWeve«,.,„,.aec<>M,„3.„ 
a prefe^eC ^eftC. h™a„i.ed a„.*„die. a. p^^ed By a p.oess of analysis of .He 

pa™.a,se,„e„eesa„d various co„cep.„a,hu™a„,.edp„d«s.i„g«„ee<U„^^^^^^^ 
mo e,s Of .He pare„.a, a.d Hu„.a„i.ed sequences. THree dHnensiona, i™,u„„„oH„,i„ 
mode, are fa„,i,ia, ,„ a,ose s«„ed in .He ^. Co„p.e, p.o^„, 

."-a.e and display probable *.ee^„en.i„nal co„fo™a«ona, s.n.c..es of selected 
can .da.e i„™^„„„,„„„ ,,,„^_ ^^^^ ^^^^ ^^^^^^ ^^^^ 

*e Hkely role of fte residues i„ d,e fimcionin, ^^^^ in»™ogl„HuH„ 
sequence, i.e., .He analysis of residues m influence Ure abi,i.y „f .He candidaie 
-nmunoglobuHn .o bind i.s antigen, m d.is way, FR r^,.,, 
combmed d,e consensus and i^por, sequence so ,Ha. Ure desired antibody 
charac.eris.ic. sucH as increased affini.y for .He Urge. an.igcn(s,. is achieved 

The .nvenHon also enconrpasses Abus conjugated .o a chemically tociional 

mccy. Typically..Hen,oie,y isalabel capable of producingadcecblesignai^ese 
conjugated Abus a,, usefl.1. for exa„,ple. in detection systems such as ,uanU,ation of 

Wr burden, and ima^ng of metastatic foci and tumor ima^ng. SucH labels a,. 

^own in tHeartandinclude.bu,a.„o.limi.ed.o, radioisotopes, en^ymes, fluorescent 
compounds, cbemiluminescen. compounds, bioluminescan, compounds subsn..e 

CO actors and H^bitc. See. for examples of paten. .eacHing flre use of such labels 

4.366,241. Themoietiescanbecovalenfly linked to Abus.rec„mbinant,y linked or 
conjugated to Abus through a secondary reagent, such as a second ».ibody, protein A 
or a biotin-avidin complex. 

other functional moieties mclude si^al peptides, agents that eritance 
.mmunologic reactivity, agents flta. facihtate coupHng ,o a solid support, vaccine 
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ca^e.. Worespo^e .„*f.e., p.^,,,,.;, . ^ 

short a„,i„„ acid sequence dtes a newly syn.hes,zed pr„«i„ teough a ceUnlar 
men^b^e. usuaUy ,he e„d„p,asn„c re„c„,„„ i„ euka^oUc ce„s. and ei«,e. d,e inner 
«en,.™e or inner and on.er „e..™es of ,ac.eHa. Signa. pep«de. a„ ,,ica„y 
a. *e ;.-,e™i„a, p„rti„„ „f a p„,ypep«de and are typically removed e„zyn,a,ically 
benveen bios>.,he.is and secretion of d,e polypeptide front the eel,. Snch a peptide can 
be tncorporated into the subject Abus to a„ow secretion of the synthesized molecules 

Agents that enhance inmunologic reactivity include, bn, are not Umited to 
bactenal superanttgens. Agents that facilitate coupling to a solid support include but 
are not hunted to, biotin or avidin. Znununogen capers include, but are no. Innit^ to 
any physiologically acceptable buffers. Bioresponse modifier, include cytokines 
parttcularly tumornecrosis factor (TW). i„ter,euldn-2. i„terleuldn-4, granulocyte 
macrophage colony stimulating factor and y-interferons. 

Suitable drug ntoieties include antineoplastic agents. Non-limiting examples are 
:ad,oiso,opes, vinca alkaloids such as fte vinblastine, vincristine and vindesine sulfates 
adnamycin. bleontycin sulfate, c^boplatin. cisplatin. cyclophosphamide, cytarabine ' 
daearbaztne. dactinontycin, duanorttbich, hydrochloride. doxo™bici„ hydrochloride ' 
etopostde, fluorouracil. lomustine. mecbloro„tha.ine hydrochloride, melphalan ' 
mercaptopurine, nrethotrexate, mitomycin, mitotane. pentostatm, pipobroman ' 
proearbaze hydrochloride, streptozotocin, taxol. thioguanine. and uracil ntust^d 

Immunotoxins. including single chain molecules, can be produced by 
—ant means. P-luction of various itnmunotoxins is well-lcnown m the art and 
medtods can be found, for example, in "Monoclonal Antibody-toxin Conjugates- ' 
Atmtng the Magic Bullet," Thotpe (.982) MonoCona, A.mo,ies i„ CUn,cal 
Medicine, Academic Press, pp. 16.-,90; Vitatta (1987) Science 238:1098-1 104- and 
Wtnter and Milstein ,1991, Na,.re 349:293-299. Suitable toxins include, but arl no. 
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limited to. ricin, radionuclides, pokeweed antiviral protein, Pseudomonas exotoxin A 
dtphtheria toxin, ricin A chain, fungal toxins such as restrictocn and phospholipase 
enzymes. See. generally. "Chimeric Toxins," Olsnes and Pihl. Pharmac. THer. 15:355- 
381 (1981); and "Monoclonal Antibodies for Cancer Detection and Therapy." eds. 
Baldwin and Byers. pp. 159-179, 224-266. Academic Press (1985). 

The chemically functional moieties can be made recombiuantly for instance by 
creating a fusion gene encoding the Abu and the fimctional moiety. Altematively. the 
Abu can be chemically bonded to the moiety by any of a variety of well-established 
chemical procedures. For example, when the moiety is a protein, the linkage may be by 
way of heterobifimctional cross linkers, e.g.. SPDP. carbodiimide glutaraldehyde, or the 
hke. The moieties may be covalently liri^ed, or conjugated, through a secondary 
reagent, such as a second antibody, protein A. or a biotm-avidin complex. Panm^agnetic 
moieties and the conjugation thereof to antibodies are well-known in the ari. S«,. e.g.. 
IMiltenyicrai (1990)Cyto»,arv 11:231-238. 

Preparation of Antigen-Binding Units (Abus): 

The subject Abus can be prepared by teeombinant DNA technology, synthetic 
chemistry techniques, or a combinarion toereot Forinst^ce. sequences encoding the 
desired components of the Abus. including VL, VH and the hetetodhnerization 
sequences are typically assembled and fragments ligated into an expression vector. 
These sequences may be assembled from other vectors encoding the desired protein 
sequence, fiom PCR-generated fragments using respective template nucleic acids, or by 
assembly of synthetic oligonucleotides encoding the desired sequences. However, all 
nucleic acid sequences encoding the Abus are preferably assembled by in-frame fi^ion 
of coding sequences. Flexons. described above, can be included between 
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componems and do„,ai„s in order to enhance ,he ability of fte individual components to 
assume a confiscations relatively independently of each other. To ptoduce Nsc Abus 
fte L and H chain can be fonned separately and then assembled, or assembled ,„ v,v. by 
an expression system for both chains. Such expression systems can be created by 
transfecting a suitable cell with a vector comprising separate transcribable regions for 
the L and H chain, or by co-transfeeting dte same cell with vectot. for each chain. 

The assembled Abus can be isolated using a variety of protein purification 
.echmques known in the art. GenentUy, the Abu is isolated from culmre media as 
s^med polypeptides. alUtough they can be recovered from host cell Iysa.es or bacterial 
periplasm, when directly produced without signal peptides. If the Abus are membrane- 
bound, they be solubilized by suitable detergent solutions commonly employed by 
artisans in the field. The recovered Abus may be further purified by salt precipitafon 
(e.g., w„h ammomum sulfate), ion exchange chromatography (e.g. on a cationic or 
anionic exchange column run at neutral pH and eluted wim step gradients of increasing 
.omc strength), gel filtration chromatography (including gel filtration HPLC), and 
chromatography on tag-affimty column, or on affinity resins such as protein A, protein 
G, hydroxyapatite, and anti-immunoglobulin. 



The invention provides various polynucleotides that encode the Abus of the 
invention. The invention polynucleotides are characterized, in part, by the unique 
heterodimerization sequences contained therein as detailed above. Such 
heterodimerization sequences enable efficient assembly of and screening for Abus, such 
as those that specifically bind to a desired antxgen. Such sequences also facilitate 
display of heteromultimers on living biological entities includmg phages, bacteria, other 
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prokaryotic or eukaryotic cells. Preferred heterodimenzation sequences shown in SEQ 
ID NOS. 2 and 4. 

In one embodiment, this invention provides isolate polynucleotides that encode 
the subject Nsc Abus. In one aspect of this embodiment, the recombinant 
polynucleotide comprises a coding sequence that encodes the light-chain polypeptide of 
a subject Nsc Abu. hi another aspect, the recombinant polynucleotide comprises a 
coding sequence that encodes the heavy-chain polypeptide of a Nsc Abu. In yet another 
aspect, the recombinant polynucleotide comprises two separate coding sequences, one 
of which codes for the Hght-chain polypeptide, and the other codes for the heavy-chain. 

Nucleotide sequences corresponding to various regions of L or H chains of an 
existing antibody can be readily obtained and sequenced using convention techniques 
including but not hmited to hybridization, PGR, and DNA sequencing. Hybridoma cells 
that produce monoclonal antibodies serve as a prefeired source of antibody nucleotide 
sequences. A vast number of hybridoma cells producing an array of monoclonal 
antibodies may be obtained from public or private repositories. The largest depository 
agent is American Type Culture Collection (http://www.atcc.org), which offers a diverse 
collection of well-characterized hybridoma cell lines. Alternatively, antibody 
nucleotides can be obtained from immunized or non-immunized rodents or humans, and 
form organs such as spleen and peripheral blood lymphocytes. Specific techniques 
appHcable for extracting and synthesizing antibody nucleotides are described in Oriandi 
et aliim) Proc. Natl. Acad. Sci. U.S.A 86: 3833-3837; Larrick et al. (1989) Biochem. 
Biophys. Res. Commun. 160:1250-1255; Sastry e^a/. (1989) Proc. Natl. Acad. Sci.. 
U.S.A. 86: 5728-5732; and U.S Patent No. 5,969,108. 

The antibody nucleotide sequences may also be modified, for example, by 
substituting the coding sequence for human heavy and Hght chain constant regions in 
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place of the homologous non-human sequences. M that manner, chimeric antibodies are 
prepared that retain the binding specificity of the original antibody. 

It is also understood that the polynucleotides embodied in the invention include 
those coding for functional equivalents and fragments thereof of the exempHfied 
polypeptides. Functionally equivalent polypeptides include those that enhance, decrease 
or not significantly affect properties of the polypeptides encoded thereby. Functional 
equivalents may be polypeptides having conservative amino acid substitutions, analogs 
including fusions, and mutants. 

Due to the degeneracy of the genetic code, there can be considerable variation in 
nucleotides of the L and H sequences, as well as the heterodimerization sequences 
suitable for construction of the polynucleotide and vectors of the present invention. 
Sequence variants may have modified DNA or amino acid sequences, one or more 
substitutions, deletions, or additions, the net effect of which is to retain the desired 
antigen-binding activity. For instance, various substitutions can be made in the coding 
region that either do not alter the amino acids encoded or result in conservative changes. 
These substitutions are encompassed by the present invention. Conservative amino acid 
substitutions include substitutions within the following groups: glycine, alanine; valine, 
isoleucine. leucine; aspatic acid, glutamic acid; asparagine, glutamine; serine, threonine; 
lysine, arginine; and phenylalanine, tyrosine. While conservative substitutions do 
effectively change one or more amino acid residues contained in the polypeptide to be 
produced, the substitutions are not expected to interfere with the antigen-binding 
activity of the resulting Abus to be produced. Nucleotide substitutions that do not alter 
the amino acid residues encoded are useful for optimizing gene expression in different 
systems. Suitable substitutions are known to those of skill in the art and are made, for 
instance, to reflect preferred codon usage in the expression systems. 
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Where desired, the recombinant polynucleotides may comprise heterologous 
sequences that facilitate detection of the expression and purification of the gene product. 
Examples of such sequences are known in the art and include those encoding reporter 
proteins such as P-galactosidase, P -lactamase, chloramphenicol acetyltransferase 
(CAT), luciferase, green fluorescent protein (GFP) and their derivatives. Other 
heterologous sequences that facilitate purification may code for epitopes such as Myc, 
HA (derived from influenza virus hemagglutinin), His-6, FLAG, or the Fc portion of 
immunoglobulin, glutathione S-transferase (GST), and maltose-binding protein (MBP). 

The polynucleotides can be conjugated to a variety of chemically functional 
moieties described above. Commonly employed moieties include labels capable of 
producing a detectable signal, signal peptides, agents that enhance immunologic 
reactivity, agents that facilitate coupling to a solid support, vaccine carriers, bioresponse 
modifiers, paramagnetic labels and drugs. The moieties can be covalently linked 
polynucleotide recombinantly or by other means known in the art. 

The polynucleotides of the invention can comprise additional sequences, such as 
additional encoding sequences within the same transcription unit, controlling elements 
such as promoters, ribosome binding sites, and polyadenylation sites, additional 
transcription units under control of the same or a different promoter, sequences that 
permit cloning, expression, and transformation of a host cell, and any such construct as 
may be desirable to provide embodiments of this invention. 

The polynucleotides embodied in this invention can be obtained using chemical 
synthesis, recombinant cloning methods, PCR, or any combination thereof Methods of 
chemical polynucleotide synthesis are well known in the art and need not be described 
in detail herein. One of skill in the art can use the sequence data provided herein to 
obtain a desired polynucleotide by employing a DNA synthesizer or ordering from a 
commercial service. 
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Polynucleotides comprising a desired sequence can be inserted into a suitable 
vector which in turn can be introduced into a suitable host cell for replication and 
amplification. Accordingly, the invention encompasses a variety of vectors comprising 
one or more of the polynucleotides of the present invention. Also provided is a 
selectable library of expression vectors comprising at least one vector encoding the 
subject Abus. 

Vectors of the present invention are generally categorized into cloning and 
expression vectors. Cloning vectors are useful for obtaining repHcate copies of the 
polynucleotides they contain, or a means of storing the polynucleotides in a 
depository for future recovery. Expression vectors (and host cells containing these 
expression vectors) can be used to obtain polypeptides produced from the 
polynucleotides they contain. Suitable cloning and expression vectors include any 
known in the art, e.g., those for use in bacterial, mammahan, yeast, insect and phage 
display expression systems. 

Suitable cloning vectors can be constructed according to standard techniques, or 
selected from a large number of cloning vectors available in the art. While the cloning 
vector selected may vary according to the host cell intended to be used, useful cloning 
vectors will generally have the ability to self-rephcate, may possess a single target for a 
particular restriction endonuclease, or may cany marker genes. Suitable examples 
include plasmids and bacterial viruses, e.g., pBR322, pMB9, ColEl, pCRl, RP4, 
PUC18, mpl8, mpl9, phage DNAs (including filamentous and non-filamentous phage 
DNAs), and shuttle vectors such as pSA3 and pAT28. These and other cloning vectors 
are available from commercial vendors such as Clontech, BioRad, Stratagene, and 
Invitrogen. 

Expression vectors containing these nucleic acids are usefiil to obtain host vector 
systems to produce proteins and polypeptides. It is implied that these expression vectors 
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must be replicable in the host organisms either as episomes or as an integral part of the 
chromosomal DNA. Suitable expression vectors include plasmids, viral vectors, 
including phagemids, adenoviruses, adeno-associated viruses, retroviruses, cosmids, etc. 
A number of expression vectors suitable for expression in eukaryotic cells including 
yeast, avian, and mammaUan cells are known in the art. One example of an expression 
vector is pcDNA3 (Invitrogen, San Diego, CA), in which transcription is driven by the 
cytomegalovirus (CMV) early promoter/enhancer. Two types of particularly usefiil 
expression vectors for expressing the subject Abus are the phage display vector and 
bacterial display vector. 

The techniques for constructing phage display vectors are well established in the 
art (see review article by Winter G. et al. (1994) Ann. Rev. Immunol. 12:433-55). Both 
filamentous and non-filamentous phage sequences are applicable for constructing a 
display vector. Filamentous phage vectors are preferred because the genomes of many 
representative phages of this class have been sequenced, and their genomes are found to 
be much smaller than that of non-filamentous phages. Representative phages of this 
class include M13, fl, fd, Ifl, Ike, Xf, Pfl, and PG. The phage vector typically is 
constructed to express heteromultimers, e.g. antibody peptides, by fusion to a part or all 
of a phage coat protein. Suitable coat proteins include pIII, VIII, VI, VII and DC of 
M13. The heteromultimer sequence must be inserted into the phage vector in such a 
way that the integrity of the expressed phage coat is not undermined, and the 
heteromultimer is preferably biologically functional. 

For constructing pIII fusion vector, commonly employed fusion sites are located 
at the amino terminus, in between the flexible spacer between the two domains of pIII 
(Smith et al. Science 288:1315-17), or any other alternative fusion sites described in 
U.S. Patent Nos. 5,969,108, 5,837,500. The pIII fusion and other proteins of the phage 
can be encoded entirely within the same page replicon or on different replicons. When 
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at least two replicons are used, the pin fusion is generally encoded on a phagemid, a 
plasmid containing a phage origin of replication. Phagemids can be packaged into 
phage particles by "rescue" with a helper phage such as M13K07, which provides all 
the phage proteins, including pIII, but due to a defect origin is itself poorly packaged in 
competition with the phagemids. Other multivalent helper phages (e.g. M13 gHI) that 
lack or contain altered pHI to enhance the package efficiency can also be employed 
(Rondot ei al. Nature Biotechnology 19:75-78). 

Similar constructions can be made with other filamentous phage. Pf3 is a well- 
known filamentous phage that infects Pseudomonas aerugenosa cells that harbor an 
hicP-1 plasmid. The entire genome has been sequenced and the genetic signals involved 
in replication and assembly are characterized. The major coat protein of PF3 is unusual 
I in having no signal peptide to direct its secretion. The sequence has charged residues 

ASP7, ARG37, LYS40, and PHE44-COO- which is consistent with the amino terminus 
being exposed. To construct a display Pf3 vector, it is generally desirable to engineer a 
signal sequence known to cause secretion in P. aerugenosa fiised in-frame to a gene 
fragment encoding a heterologous polypeptide, which in turn is fiised in-frame with a 
^ DNA encoding the mature Pfi coat protein. 

The same general construction scheme appHes to generating display vectors 
containing sequences derived from non-filamentous phages including bacteriophage 
X174, X, T4 and T7 phages. A wealth of information on the structures of these non- 
filamentous phages is known in the art. One skilled in the art can readily generate a 
corresponding display vector that expresses the subject heteromultimers using the 
unique heterodimerization sequences without undue experimentation. 

In addition to phage display vector, another class of preferred vector is bacterial 
display vector. The general scheme outlined above is equally applicable for 
constructing such vectors. Briefly, the vectors facihtate expression of a heteromultimer, 
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Abus in particular, as a fusion with a bacterial surface protein. Prior research has 
revealed a vast number of bacterial surface proteins applicable for expressing such 
fusions. Non-limiting examples of bacterial surface proteins are LamB (Bremer et al. 
Proc. Natl. Acad. Set U.S.A. (1984) 81:3830-34; Gene (1987) 52:165-73); OmpA (Prog 
Biophys Molec Biol (1987) 49:89-1 15); OmpC; OmpF (Pages et al. Biochemimie 
(1990) 72:169-76); PhoE (van der Ley et al. J. Biol Chem. 261:12222-5); pilin (So et 
al. Curr Top in Microbiol & Immunol (1985) 118:13-28); pldA (de Geus et al. EMBO J. 
(1984) 3(8): 1799-1802) and their homologs. Characterization of these and other 
surface proteins, and the methods of using these proteins for displaying heterologous 
polypeptides are detailed in U.S. Patent No. 5,837,500 as well as the references cited 
therein. 

The vectors of the present invention generally comprises a transcriptional or 
translational control sequences required for expressing the Abus. Suitable transcription 
or translational control sequences include but are not limited to replication origin, 
promoter, enhancer, repressor binding regions, transcription initiation sites, ribosome 
binding sites, translation initiation sites, and termination sites for transcription and 
translation. 

As used herein, a "promoter" is a DNA region capable under certain conditions 
of binding RNA polymerase and initiating transcription of a coding region located 
downstream (in the 3' direction) from the promoter. It can be constitutive or inducible, 
hi general, the promoter sequence is bounded at its 3' terminus by the transcription 
initiation site and extends upstream (5' direction) to include the minimum number of 
bases or elements necessary to initiate transcription at levels detectable above 
background. Within the promoter sequence is a transcription initiation site, as well as 
protein binding domains responsible for the binding of RNA polymerase. Eukaryotic 
promoters will often, but not always, contain "TATA" boxes and "CAT" boxes. 
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The choice of promoters will largely depend on the host cells in which the vector 
is introduced. For animal cells, a variety of robust promoters, both viral and non-viral 
promoters, are known in the art. Non-Umiting representative viral promoters include 
CMV, the early and late promoters of SV40 virus, promoters of various types of 
adenoviruses (e.g. adenovirus 2) and adeno-associated viruses. It is also possible, and 
often desirable, to utilize promoters normally associated with a desired light or heavy 
chain gene, provided that such control sequences are compatible with the host cell 
system. 

Suitable promoter sequences for other eukaryotic cells include the promoters for 
3-phosphoglycerate kinase, or other glycolytic enzymes, such as enolase, 
glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, 
pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and 
glucokinase. Other promoters, which have the additional advantage of transcription 
controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, 
isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen 
metabolism, and the aforementioned glyceraldehyde-3-phosphate dehydrogenase, and 
enzymes responsible for maltose and galactose utilization. 

In certain preferred embodiments, the vectors of the present invention use strong 
enhancer and promoter expression cassettes. Examples of such expression cassettes 
include the human cytomegalovirus immediately early (HCMV-IE) promoter (Boshart 
et al. Cell 41: 521,(1985)), the p-actin promoter (Gunning et al. (1987) Proc. Natl. 
Acad. Set (USA) 84.- 5831), the histone H4 promoter (Guild et a/.(1988), J. Viral. 62. 
3795), the mouse metallothionein promoter (Mclvor et al. (1987), Mol, Cell. Biol. 7: 
838), the rat growth hormone promoter (Millet et al. (1985), Mol. Cell Biol. 5: 43 1), the 
human adenosine deaminase promoter (Hantzapoulos et al. (1989) Proc. Natl. Acad. 
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Sci. USA 86: 3519), the HSV tk promoter 25 (Tabm et al. (1982) Mol. Cell Biol. 2: 
426), the a-1 antitrypsin enhancer (Peng et al. (1988) Proc. Natl. Acad. Sci. USA 85: 
8146), and the immunoglobulin enhancer/promoter (Blankenstein et al. (1988) Nucleic 
Acid Res. 16: 10939), the SV40 early or late promoters, the Adenovirus 2 major late 
promoter, or other viral promoters derived from polyoma viris, bovine papilloma vims, 
or other retroviruses or adenoviruses. The promoter and enhancer elements of 
immunoglobulin (Ig) genes confer marked specificity to B lymphocytes (Banerji et al. 
(1983) Cell 33: 729; Gillies et al. (1983) Cell 33: 717; Mason et al. (1985) Cell 41: 
479), while the elements controlling transcription of the B-globin gene function only in 
erythroid cells (van Assendelft et al. (1989) Cell 56:969). 

Cell-specific or tissue-specific promoters may also be used. A vast diversity of 
tissue specific promoters have been described and employed by artisans in the field. 
Exemplary promoters operative in selective animal cells include hepatocyte-specific 
promoters and cardiac muscle specific promoters. Depending on the choice of the 
recipient cell types, those skilled in the art will know of other suitable cell-specific or 
tissue-specific promoters applicable for the construction of the expression vectors of the 
present invention. 

Using well-known restriction and hgation techniques, appropriate transcriptional 
control sequences can be excised from various DNA sources and integrated in operative 
relationship with the intact selectable fiision genes to be expressed in accordance with 
the present invention. 

In consfructing^ subject vectors, the termination sequences associated with the 
transgene are aIsoh;;;^^JlBRn^^ desired to be transcribed to 

provide polyadenylation of the mRNA and^^^^;^^ripl^l«^^ signal. The 
terminator sequence preferably contains one or more transcriptioIIdtS^SlSat 
sequences (such as polyadenylation sequences) and may also be lengthened by the 
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inclusion of additional DNA sequencelso as to further disrupt transcriptional read- 
through. Preferred terminator sequences (or termination sites) of the present invention 
have a gene that is followed by a transcription termination sequence, either its own 
termination sequence or a heterologous Ibrmination sequence. Examples of such 
termination sequences include stop codons coupled to various polyadenylation 
sequences that are known in the art, widely available, and exemplified below. Where 
the terminator comprises a gene, it can be mvantageous to use a gene which encodes a 
detectable or selectable marker; thereby providing a means by which the presence 
and/or absence of the terminator sequence (aftd therefore the corresponding inactivation 
and/or activation of the transcription unit) cambe detected and/or selected. 

Li addition to the above-described elements, the vectors may contain a selectable 
marker (for example, a gene encoding a protein necessary for the survival or growth of a 
host cell transformed with the vector), although such a marker gene can be carried on 
another polynucleotide sequence co-introduced into the host cell. Only those host cells 
into which a selectable gene has been introduced will survive and/or grow under 
selective conditions. Typical selection genes encode protein(s) that (a) confer resistance 
to antibiotics or other toxins, e.g., ampicillin, neomycyin, G418, methotrexate, etc; (b) 
complement auxotrophic deficiencies; or (c) supply critical nutrients not available from 
complex media. The choice of the proper marker gene will depend on the host cell, and 
appropriate genes for different hosts are known in the art. 
169 In a preferred embodiment, the vector is a shuttle vector, capable of replicating 

in at least two unrelated expression systems. Li order to facilitate such replication, the 
vector generally contains at least two origins of replication, one effective in each 
expression system. Typically, shuttle vectors are capable of replicating in a eukaryotic 
expression system and a prokaryotic expression system. This enables detection of 
protein expression in the eukaryotic host (the expression cell type) and amplification of 
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fte vector in U,e prokaryotic hos, (the amplification cell type). Preferably, one origin of 
rephcatton is derived fro™ SV40 and one is derived fron, pBR322 although ar,y suitable 
ongtn taown in the art may be used provided it directs replication of the vector Where 
*e vector is a shuttle vector, Ute vector ptefe^bly contains at least two selectable 
markers, one for the expression cell type »d one for the antpliflcation cell type Any 
selo^able marker known in me ar, or those described herein may be used provided it 
functions m the expression system being utilized 

The vectors embodied in this invention can be obtamed using recombinant 

cloumgmethods .nd/or by chemical synthesis. Avastnumber of recombinantcloning 
.echniciues such as PCR. restriction endonuclease digestion ^d ligation are weU lotown 
.n the a,., and need not be described in detail herein. One of skill in a,e an can also use 
.he sequence data provided herein or that in the public or proprietaty databases to obtain 
a desired vector by any synthetic means available in the art. 

Host Celk nf the Pr-;,.... ■■...-I^n- 

The invention provides host cells transfect^J with the vectot. or a library of the 
expression vecto,. described above. Tie expression vectors can be introduced into a 

-table ptokaryotic or eukaryoric cell by any ofanumberofappropriatemeans 
mcludtng electtoporation, mic^projectile bombardment, lipofection. infection (whet. 

vector is coupled to an infectious agent), transfecrion employing calcium chloride 
rubtdtum chloride, calcium phosphate, DEAE-dextran. or other substances. The choice 
of the means for inttoducing vectors will often depend on features of the hos, cell. 

For most ammal cells, any of the above-mentioned methyls is suitable for vector 
dehvery. Preferred animal cells are vertebrate cells, preferably mammalian cells 
capable of expressing exogenously intt^uced gene products in large quantity, e.g at the 
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milligram level. Non-limiting examples of preferred cells are NIH3T3 cells, COS, 
HeLa, and CHO cells. 

The animal cells can be cultured in a variety of media. Commercially available 
media such as Ham's FIO (Sigma), Minimal Essential Medium (MEM, Sigma), RPMI- 
1640 (Sigma), and Dulbecco's Modified Eagle's Medium (DMEM, Sigma) are suitable 
for culturing the host cells. In addition, animal cells can be grown in a defined medium 
that lacks serum but is supplemented with hormones, growth factors or any other factors 
necessary for the survival and/or growth of a particular cell type. Whereas a defined 
medium supporting cell survival maintains the viability, morphology, capacity to 
metabolize and potentially, capacity of the cell to differentiate, a defined medium 
promoting cell growth provides all chemicals necessary for cell proliferation or 
multiplication. The general parameters governing mammalian cell survival and growth 
in vitro are well established in the art. Physicochemical parameters which may be 
controlled in different cell culture systems are. e.g., pH, p02, temperature, and 
osmolarity. The nutritional requirements of cells are usually provided in standard media 
formulations developed to provide an optimal environment. Nutrients can be divided 
into several categories: amino acids and their derivatives, carbohydrates, sugare, fatty 
acids, complex lipids, nucleic acid derivatives and vitamins. Apart fi-om nutrients for 
maintaining cell metabolism, most cells also require one or more hormones from at least 
one of the following groups: steroids, prostaglandins, growth factors, pituitary 
hormones, and peptide hormones to proliferate in serum-free media (Sato, G.H., et al. in 
"Growth of Cells in Hormonally Defined Media", Cold Spring Harbor Press, N.Y., 
1982). In addition to hormones, cells may require transport proteins such as fransfeirin 
(plasma iron transport protein), ceruloplasmin (a copper transport protein), and high- 
density lipoprotein (a lipid carrier) for survival and growth in vitro. The set of optimal 
hormones or transport proteins will vary for each cell type. Most of these hoimones or 
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transport proteins have been added exogenously or, in a rare case, a mutant cell line has 
been found which does not require a particular factor. Those skilled in the art will know 
of other factors required for maintaining a cell culture without undue experimentation. 

For plant cells, a variety of vector dehvery techniques is available in the art. The 
host cells may be in the form of whole plants, isolated cells or protoplasts. Illustrative 
procedures for introducing vectors into plant cells include Agrobacterium mediated 
plant transforation, protoplast transformation, gene transfer into pollen, injection into 
reproductive organs and injection into immature embryos. As is evident to one skilled 
in the art, each of these methods has distinct advantages and disadvantages. Thus, one 
particular method of introducing vectors into a particular plant species may not 
necessarily be the most effective for another plant species. 

Agrobacterium tumefaciens-mediated transfer is a widely applicable system for 
introducing vectors into plant cells because the vector can be introduced into whole 
plant tissues, bypassing the need for regeneration of an intact plant from a protoplast. 
The use of Agrobacterium-mediated expression vectors to introduce vector into plant 
cells is well known in the art. This technique makes use of a common feature of 
Agrobacterium which colonizes plants by transferring a portion of their DNA (the T- 
DNA) into a host cell, where it becomes integrated into nuclear DNA. The T-DNA is 
defined by border sequences which are 25 base pairs long, and any DNA between these 
border sequences is transferred to the plant cells as well. The insertion of a recombinant 
plant viral nucleic acid between the T-DNA border sequences results in transfer of the 
recombinant plant viral nucleic acid to the plant cells, where the recombinant plant viral 
nucleic acid is replicated, and then spreads systemically through the plant. 

Because not all plants are natural hosts for Agrobacterium, alternative methods 
such as transformation of protoplasts may be employed to introduce the subject vectors 
into the host cells. For certain monocots, transformation of the plant protoplasts can be 
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achieved using methods based on calcium phosphate precipitation, polyethylene glycol 
treatment, electroporation, and combinations of these treatments. 

hi addition to protoplast transformation, particle bombardment is an alternative 
and convenient technique for delivering the invention vectors into a plant host cell. 
Specifically, the plant cells may be bombarded with microparticles coated with a 
plurality of the subject vectors. Bombardment with DNA-coated microprojectiles has 
been successfully used to produce stable transfonnants in both plants and animals (see, 
for example, Sanford et al. (1993) Methods in Enzymology, 217:483-509). 
Microparticles suitable for introducing vectors into a plant cell are typically made of 
metal, preferably tungsten or gold. These microparticles are available for example, from 
BioRad (e.g., Bio-Rad's PDS-lOOO/He). Those skilled in the art will know that the 
particle bombardment protocol can be optimized for any plant by varying parameters 
such as He pressure, quantity of coated particles, distance between the macrocairier and 
the stopping screen and flying distance from the stopping screen to the target. 

Vectors can also be introduced into plants by direct DNA fransfer into pollen as 
described by Zhou et al. Methods in Enzymology, 101 :433 (1983); D. Hess. Intern Rev. 
Cytol, 107:367 (1987); Luo et al. Plant Mol. Biol. Reporter, 6:165 (1988). 
Alternatively, the vectors can be injected into reproductive organs of a plant as 
described by Pena et al.. Nature, 325:274 (1987). 

Other techniques for introducing nucleic acids into a plant cell include: 

(a) Hand hioculations. Hand inoculations are performed using a neutral pH, 
low molarity phosphate buffer, with the addition of celite or 
carborundum (usually about 1%). One to four drops of the preparation is 
put onto the upper surface of a leaf and gently rubbed. 

(b) Mechanized Inoculations of Plant Beds. Plant bed inoculations are 
performed by spraying (gas-propelled) the vector solution into a tractor- 
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driven mower while cutting the leaves. Alternatively, the plant bed is 
mowed and the vector solution sprayed immediately onto the cut leaves. 

(c) High Pressure Spray of Single Leaves. Single plant inoculations can also 
be performed by spraying the leaves with a narrow, directed spray (50 
psi, 6-12 inches from the leaf) containing approximately 1% 
carborundum in the buffered vector solution. 

(d) Vacuum Infiltration. Inoculations may be accomplished by subjecting a 
host organism to a substantially vacuum pressure environment in order to 
facilitate infection. 

Other suitable host cells for cloning and expressing the subject vectors are 
prokaryotes and eukaryotic microbes such as fungi or yeast cells. Suitable prokaryotes 
for this purpose include bacteria including Gram-negative and Gram-positive organisms. 
Representative members of this class of microorganisms are Enterobacteriaceae (e.g^. 
CO/0, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella (e.g. Sahnonella 
typhimurium), Seiratia (e.g., Serratia marcescans). Shigella, Neisseria (e.g. Neisseria 
meningitidis) as well as Bacilli (e.g. Bacilli subtilis and Bacilli Hcheniformis). 
Preferably, the host cell secretes minimal amounts of proteolytic fragments of the 
expressed Abus. Commonly employed fimgi (including yeast) host cells are S. 
cerevmae. Kluyveromyces lactis (K. lactis), species of Candida including C. albicans 
and C. glabrata, C. maltosa. C. utUis. C. stellatoidea. C. parapsilosis, C. tropicalus, 
Neurospora crassas. Aspergillus nidulans. Schizosaccharomyces pombe (S. pombe), 
Pichiapastoris, and Yarowia lipolytica. 

Once introduced into a suitable host cell, expression of the Abus can be 
determined using any nucleic acid or protein assay known in the art. For example, the 
presence of transcribed mRNA of L or H chain, or the Sc Abu can be detected and/or 
quantified by conventional hybridization assays (e.g. Northern blot analysis), 
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amplification procedures (e.g. RT-PCR), SAGE (U.S. Patent No. 5,695,937), and array- 
based technologies (see e.g. U.S. Pat. Nos. 5,405,783, 5,412,087 and 5,445,934), using 
probes complementary to any region of Abu polynucleotide. 

Expression of the vector can also be determined by examining the Abu 
expressed. A variety of techniques are available in the art for protein analysis. They 
include but are not limited to radioimmunoassays, ELISA (enzyme linked 
immunoradipmetric assays), "sandwich" immunoassays, immunoradiometric assays, in 
situ immunoassays (using e.g., colloidal gold, enzyme or radioisotope labels), western 
blot analysis, immunoprecipitation assays, immunoflourescent assays, and PAGE-SDS. 

Uses of the polynucleoti des, vectors and host cells of the present invention: 

The polynucleotides and vectors of this invention have several specific uses. 
They are useful, for example, in expression systems for the production of both Sc and 
Nsc Abus. The polynucleotides are useful as primers to effect amplification of desired 
polynucleotides. Furthermore, The polynucleotides of this invention are also useful in 
pharmaceutical compositions including vaccines, diagnostics, and drugs. 

The host cells of this invention can be used, inter alia, as repositories of the 
subject polynucleotides, vectors, or as vehicles for producing and screening desired 
Abus based on their antigen binding specificities. 

Accordingly, the invention provides a method of identifying a Nsc Abu that is 
immunoreactive with a desired antigen. The method involves the following steps: (a) 
preparing a genetically diverse repertoire of Abus, wherein the repertoire comprises at 
least one subject Abu; (b) contacting the repertoire of antigen binding units with the 
desired antigen; (c) detecting a specific binding between Abus and the antigen, thereby 
identifying the Abu that is immunoreactive with the desired antigen. 
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The ability of an Abu to specifically bind to a desired antigen can be tested by a 
variety of procedures well established in the art. See Harlow and Lane (1988) 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, New York; 
Gherardi et al (1990) J, Immunol Meth, 126:61-68. Typically, Abus exhibiting desired 
binding specificities can be detected directly by immunoassays, for example, by reacting 
labeled Abus with the antigens that are immobilized on a solid support or substrate. In 
general, the substrate to which the antigen is adhered is fabricated with material 
exhibiting a low level of non-specific binding during immunoassay. A preferred solid 
support is made fi-om one or more of the following types of materials: plastic polymers, 
glass, cellulose, nitrocellulose, semi-conducting material, and metal. Preferably, the 
substrate is petri dish, chromatography beads, magnetic beads, and the like. 

For such solid-phase assay, the unreacted Abus are removed by washing. In a 
Hquid-phase assay, however, the unreacted Abus are removed by some other separation 
technique, such as filtration or chromatography. After binding the antigen to the labeled 
Abus, the amount of bound label is determined. A variation of this technique is a 
competitive assay, in which the antigen is bound to saturation with an original binding 
molecule. When a population of the subject Abu is introduced to the complex, only 
those that exhibit higher binding affinity will be able to compete, and thus remain bound 
to the antigen. 

Alternatively, specific binding to a given antigen can be assessed by cell sorting, 
which involves presenting the desired antigen on the cells to be sorted, then labeling the 
target cells with Abus that are coupled to detectable agents, followed by separating the 
labeled cells fi-om the unlabeled ones in a cell sorter. A sophisticated cell separation 
method is fluorescence-activated cell sorting (FACS). Cells traveling in single file in a 
fine stream are passed through a laser beam, and the fluorescence of each cell bound by 
the fluorescently labeled Abu is then measured. 
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Subsequent analysis of the eluted Abus may involve protein sequencing for 
delineating the amino acid sequences of the L and H chains. Based on the deduced 
amino acid sequences, the cDNA encoding the antibody polypeptides can then be 
obtained by recombinant cloning methods including PGR, Ubrary screening, homology 
searches in existing nucleic acid databases, or any combination thereof. Commonly 
employed databases include but are not limited to GenBank, EMBL, DDBJ, PDB, 
SWISS-PROT, EST, STS, GSS, and HTGS. 

When the repertoire of Abu is displayed on phage or bacterial particles, selection 
is preferably performed using affinity chromatography. The method typically proceeds 
with binding a repertoire of phage Abus to an antigen coated plates, colunm matrices, 
cells or to biotinylated antigen in solution followed by capture. The phages or bacteria 
bound to the solid phase are washed and then eluted by soluble hapten, acid or alkali. 
Alternatively, increasing concentrations of antigen can be used to dissociate the Abus 
fi-om the affinity matrix. For certain Abus with extremely high affinity or avidity to the 
antigen, efficient elution may require high pH or mild reducing solution as described in 
WO 92/01047. 

To avoid potential difficulties in recovering the bound Abus with the desired 
binding specificities, protease cleavage sites may be introduced between the 
heterodimerization sequences and the phage coat protein employed for displaying the 
Abus. Cleavage sites applicable for this purpose include but are not limited to Factor X, 
trypsin, and thrombin recognition sites. After binding the phage repertoire to an affinity 
matrix and washing the non-specific phages, the remaining phages that display the Abus 
with the desired affinity can be collected by washing the antigen-affinity matrix with 
protease under conditions suitable for digestion at the cleavage site. Such digestion 
would release the Abus firom the phage particles. 
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An alternative procedure to the above is to take the affinity matrix which has 
retained the strongly bound phage or bacterial particles and extract their nucleic acids, 
for example by boiling in SDS solution. Extracted nucleic acids can be used to directly 
transform E. coli host cells or ahematively the antibody encoding sequences can be 
amplified by PGR using suitable primers. 

The efficiency of selection is likely to depend on a combination of several 
factors, including the kinetics of dissociation during washing, and whether multiple 
Abus on a single phage or bacterium can simultaneously bind to antigens on a solid 
support. For example, antibodies with fast dissociation kinetics (and weak binding 
affinities) should be retained by use of short washes, multivalent display and a high 
coating density of antigen at the solid support. Conversely, the selection of Abus with 
slow dissociation kinetics (and good binding affinities) should be favored by use of long 
washes, monovalent phages, and a low coating density of antigen. 

Where desired, the repertoire of Abus can be pre-selected against an unrelated 
antigen to counter-select the undesired Abus. The repertoire may also be pre-selected 
against a related antigen in order to isolate, for example, anti-idiotypic Abus. 

The subject Abu repertoire enables rapid isolation of Abus with desired 
specificities. Many of the isolated Abus would be expected to be difficult or impossible 
to obtain through conventional hybridoma or transgenic animal technology. 

Kits comprising the v ectors of the present invention 

The present invention also encompasses kits containing the vectors of this 
invention in suitable packaging. Kits embodied by this invention include those that 
allow generation of Abus reconstituted via pairwise affinity of a unique 
heterodimerization sequence pair as described herein. 
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Each kit necessarily comprises the reagents which render the deUvery of vectors 
into a host cell possible. The selection of reagents that facilitate delivery of the vectors 
may vary depending on the particular transfection or infection method used. The kits 
may also contain reagents useful for generating labeled polynucleotide probes or 
proteinaceous probes for detection of Abus. Each reagent can be supplied in a solid 
form or dissolved/suspended in a liquid buffer suitable for inventory storage, and later 
for exchange or addition into the reaction medium when the experiment is performed. 
Suitable packaging is provided. The kit can optionally provide additional components 
that are useful in the procedure. These optional components include, but are not limited 
to, buffers, capture reagents, developing reagents, labels, reacting surfaces, means for 
detection, control samples, instructions, and interpretive information. 

Further illustration of the development and use of Abus, polynucleotides, vectors 
and host cells according to this invention are provided in the Example section below. 
The examples are provided as a guide to a practitioner of ordinary skill in the art, and 
are not meant to be limiting in any way. 
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EXAMPLES 

Construction of Non-Single-Chain Antigen-Binding Units: Coiled-Coil Fv (ccFv) 

As described above, the Fv fragment is the smallest antibody fragment 
containing the whole antigen-binding site. Composed of the two variable regions of 
heavy and light chain (VH and VL), Fv is located at the "upper" tips of the Y-shaped 
immunoglobulin molecule. The Fv fragments have very low interaction energy between 
their two VH and VL fragments, and are often too unstable for many applications at 
physiological condition. Li a naturally occurring immunoglobulin (e.g. Ig), an 
interchain disulfide bond located in the constant domains CHI and CL is used to link 
the VH and VL. This linkage makes a stabilized antigen-binding fragment Fab with a 
molecular weight 50 kDa. It has been shown that the VH and VL fragments can also be 
artificially held together by a short peptide linker between the carboxyl terminus of one 
fragment and amino-terminus of another to form a single-chain Fv antibody fragment 
(scFv). The scFv antigen-binding unit is only half the size of Fab. However, some scFv 
proteins are also unstable. The polypeptide linker in scFv can interfere with binding in 
some cases. An interchain disulfide bond has also been introduced into the framework 
regions in VH and VL to form a disulfide-stabilized Fv (dsFv). The dsFv configuration 
also has profound limitations. The introduction of two Cys residues into the antigen- 
binding variable regions may change the intrachain disulfide bond in VH or VL, 
therefore interfere with antigen binding. 

We have devised a new strategy to stabilize VH and VL heterodimer. We have 
designed and used a unique heterodimerization sequence pair to create a Fab-like, 
fimctional artificial Fv fragment (ccFv). The heterodimerization pair were derived from 
the heterodimeric receptors GABAb receptors 1 and 2. The pair of sequences form a 
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coiled-coil structure and mediate the functional heterodimerization of GABAb-R1 and 
GABAb-R2 receptors. 

Disti^iguished from previously characterized coiled-coil leucine zippers from the 
Fos andJun protems, the C-terminal coiled coil of GABAb-R1 and GABAb-R2 
receptors do not fomihcmodimers under physiological conditions (e.g. in vivo); nor do 
they form homodimers at plW)logical body temperatures. Research by Kuner et al. 
and White et al. {Science (1999) 2«^: 74-77); Nature (1998) 396: 679-682)) have 
demonstrated the heterodimerization sp^flcity of GABAb-R1 and GABAb-R2 in vivo. 
In fact, White et al. were able to clone GAbX-R2 from yeast cells based on the 
exclusive specificity of this heterodimeric receptoWir. In vitro studies by Kammerer 
et al. supra has shown that neither GABAb-R1 nor GARAb-R2 C-teminal sequences is 
capable of forming homodimers in physiological buffer conditions when assayed at 
physiological body temperatures. However, none of these reseafbhes who were 
involved in the original isolation of the GABAb-R2 gene and the cha^cterization of the 
coiled-coil sequences describe or even suggest the use of this unique hetehxiimerization 
sequences for construction of heteromultimers such as antigen-binding units. 

/^*^modified the carboxyl terminus of GRl and GR2 domains by adding a 
flexon "SerArgGlyl3l5(^Gly" to the amino-terminus of GRl and GR2 domains to 
provide additional flexibilit>^t€^e V regions. To fiirther stabilize ccFv, we have 
infroduced a pair of cysteine residues1>5(^dding "ValGlyGlyCys" spacer at the C- 
termini of the coiled coil. The GRl and GR^kcnains are fiised to the carboxyl terminus 
of VH and VL fragment respectively. TheVH-GRl^VL-GR2 fiisions were 
expressed in E. coli and displayed by phage. As shown in Figmres 10-11, fimctional 
heterodimeric ccFv Abus stabilized by the parallel coiled-coil hdiWere generated. 
Since the coiled-coil heterodimerization sequences are about half the sizb^of CHI and 
CL domains, the ccFv (approximately 35 kDa) is smaller than the convention^ Fab 
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fragments (apph>ximately 50 kDa). Because of the small size, the ccFvs and its 
derivatives are potentiallylnore useful for clinical applications such as tumor and tissue 
penetration. More efficient expressioirand^splay of ccFv is expected. Furthermore, 
the specific assembly of VH and VL regions due tothe^p^mvise affinity of the unique 
heterodimerization sequences makes the construction of a robust, vast diverse repertoire 
of Abs more feasible. 



Materials and Methods: 
203 Bacterial and phage strain: Escherichia coli TGI ( supE A(hsdM-mcrB)5(rkmk 

McrB')thi A(lac-proAB /F' traD36, LacP A(lacZ)M15] was used for plasmid DNA and 
phage production; K07 helper phage and HRP-conjugated anti-M13 antibody from 
Phamersham Pharmacia Biotech; pbluescript SK(+) from Stratagene; Anti-HA antibody 
from Santa Cruz Biotechnology. 



Example 1 : Vector construction 
pABMXl and pABMX2 Vectors: 

204 The phagemid display vector pABMXl and pABMX2 were derived from 

pbluescript SK(+). A unique Agel restriction site was introduced immediately after lac 
promoter by PCR-based site-directed mutagenesis with a set of primer (pBS-SKa: 
5'GGAATTGTGAGCGGATAACAATTTACCGGTCACACAGGAAACAGCTATGA 
CCATG-3' and pBS-SKb 

5'CATGGTCATAGCTGTTTCCTGTGTGACCGGTAAATTGTTATCCGCTCACAA 
TTCC-3'), and the Xhol and Kpnl sites were deleted by cutting and blunt end ligation. 
Afterwards, the synthetic DNA fragments flanked by Agel site at 5 'and Bglll/EcoRI 
sites at 3', containing translation enhance sequence EP from T7 phage gene 10 
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(TTAACTTTA), ribosome binding sequence S/D (TAAGGAGG), fd phage geneS 
leader sequence with Hindlll site 

(ATGAAAAAGTCTTTAGTCCTCAAAGCCTCCGTAGCCGTTGCTCCCTCGTTCC 
GATGCTAAGCTTCGCT, for pABMXl) or pelB leader sequence 
(ATGAAATACCTATTGCCTACGGCAGCCGCTGGATTGTTA 
TTACTCGCGGCCCAGCCGGCCATGGCG, for pABMX2) and HA-(His)6-tag 
(DHtag)(TATCCATACGACGTACCAGACTACGCAGGAGGTCATCACCATCATC 
ACC ATT AG), were cloned into modified pbluescript SK(+). The resulting vectors are 
designated pABMXl and pABMX2 (see Fig. 3A-B for restriction maps and sequences). 
Heterologous sequences encoding heteromultimers such as Nsc Abus were further 
subcloned into these vectors for periplasmic expression. 

pABMDl and pABMD2 vectors: 

A PCR-amplified fd gene III (or gene 3) fragment flanked by Bglll and Sail sites 
was inserted into pABMXl and pABMX2 vectors (see Fig. 4). The heterologous 
sequence to be displayed can be inserted after the leader sequences. The lac promoter 
drives expression of the pIII capsid fusion, which in turn can be displayed on phage 
surface after superinfection by helper phage such as K07. 

PABMX5 and pABMX6 vectors: 

These two vectors were derived from pABMXl and pABMX2. A synthetic 
DNA fragment flanked by Xbal/AscI site at 5'and MluI/XhoI/NotI sites at 3', 
containing the ribosome binding sequence S/D (TAAGGAGG) and Gene 3 leader 
sequence (ATGAAA AAATTATTATTCGCAATTCCTTT AGTTGTTC CTT 
TCTATTCTCACTCCGCT), was inserted into pABMXl and pABMX2 by Xbal/NotI 
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sites. Afterward, the GRl domain coding sequence (Fig. 2) was subcloned into 
Xba/AscI sites, and GR2 domain coding sequence (Fig. 2) was inserted into XhoI/NotI 
site. Then the VH and VL domain were inserted before GRl and GR2 sequence 
respectively. A schematic representation of the vectors pABMXS and pABMX6 are 
shown as Fig. 5A. These vectors express two proteins: VH-GRl and VL-GR2 under 
one lac promoter. 

pABMDS and pABMD6 vector: 

The ccFv DNA fragments from vector pABMXS and pABMX6 vectors were 
subcloning into pABMDl and pABMD2 to yield vectors pABMDS and pABMD6 (see 
Fig. 6A-B for restriction maps and sequences). These vectors express two proteins: VH- 
GRl and VL-GR2-pIII fusions. The expressed VH-GRl and VL-GR2-pIII fiisions are 
secreted into periplasmic space, where dimerization may take place via the coiled-coil 
domain heterodimerization. The assembled Abu is then displayed on phage surface 
upon superinfection of helper phages such as K07. 

Example 2: Expression of Functional ccFv 

Antibody variable domains from antibody AMI were subcloned into pABMX6 
vector to expression ccFv fragment. The vector was then infroduced into either TGI 
cells or BL21 cells. The transformed bacteria in 500 ml 2xYT containing approximately 
100 ug/ml carbenicillin and 0.1% glucose from a single colony was grew to ODeoo = 0.7 
(approximately), at 37°C. 1 mM of IPTG was added for 4 hrs inductions at 30°C. The 
bacterial pellet was collected for periplasmic and osmotic shock preparation. The pellet 
was resuspended in 12.5 ml PPB buffer (200 mg/ml sucrose, ImM EDTA, 30 mM Tris- 
HCl, pH 8.0) with 1.25 ml protease inhibitor cocktail from Sigma, and put on ice for 20 
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min. The supernatant was collected by spinning. The pellet was resuspended in 5 mM 
MgS04, and incubated on ice for 20 min. The MgS04 and PPB supematants were 
combined, and dialyzed against PBS. After loading to a 1ml of Ni-NTA column, the 
His-tag proteins were purified by 350 mM imidozole elution. Figure lOA shows that 
the purified ccFv has an electrophoretic mobility of 35 kDa on a non-reducing gel. 
When analyzed under reducing condition, two subunits corresponding to VL and VH 
were observed. The upper band was confirmed as VL-His-tag fusion by Western blot 
analysis. 

To measure the binding specificity of soluble AMl-ccFv, ELISA assay was 
carried out. The AMI antigens (0.2- lug/well) were coated on ELISA plates for 
overnight at 4 °C. After 5% milk/PBS blocking, antibody solution in 5% milk/PBS was 
added to the ELISA plate, and incubated for 1-2 hr at room temperature. The unbound 
Abus were washed out. Figure lOB shows specific binding of AMl-ccFv to its antigen. 
The control contains 5% milk in PBS. This result confirms the assembly of functional 
ccFv by the coiled-coil GABAb R1/R2 heterodimerization sequences. 

Example 3: Display of Functional ccFv 

Display of antibody by a genetic package is a powerful tool to enrich and isolate 
specific Abu from large libraries. To analyze whether ccFv can be utilized in a phage 
display system, we have constructed a phagemid vector by subcloning ccFv gene of 
AMI antibody into pABMD6 vector. TGI cells carrying the phagemid vectors were 
superinfected by K07 helper phage. The infected TGI cells were grown in 
2xYT/Amp/Kan at 30 overnight. The phagemid particles were precipitated by 
PEG/NaCl from culture supematants twice, and resuspended in PBS. The antibody 
displayed on phage was detected by antigen binding activity via phage ELISA assay. 
Briefly, the antigens were first coated on ELISA plates. After 5% milk/PBS blocking. 
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the phage solution was added to ELISA plates. The phage bound to antigen was detected 
by incubation with HRP-conjugated anti-M13 antibody. The substrate ABTS 
[2,2'Azino-bis(3-ethylbenzthiazoline-6-sulfonic acid)] was used for measurement of 
HRP activity. The anti-HA tag antibody was also used to detect the proteins displayed 
on phage particle. The anti-HA antibody was coated on 96-well plates (2 ug/each well). 
The phages bound to anti-HA antibody coated on ELISA plate were detected by HRP- 
conjugated anti-M13 antibody. 

The single-chain antibody phage was also prepared for comparing the ccFv and 
scFv phage-display. As shown in Figure 1 lA-B, the binding capability of ccFv phages 
is comparable to that of the conventional scFv phages. For certain ccFv expressing 
phages, their binding capability is ahnost one order of magnitude higher than phages 
expressing conventional scFv (Figure 1 IB). Thus, ccFv is a functional Abu even when 
displayed on a phage particle. 

Expression of Single-Chain Antigen-Binding Units: 

Example 4: Expression of conventional scFv 

The AM2-scFv was subcloning into soluble expression vector pABMXl at the 
Hindni/NotI sites. The periplasmic preparation was carried out as outlined above. A 30 
kDa antibody protein purified fi:om NI-NTA column was confirmed by SDS-PAGE 
analysis, and was tested for its antigen-binding specificity using ELISA. The AM2 
antigens were first coated onto ELISA plates at a concentration of 0.2 ug/well. 
Different amounts of AM2-scFv fi-agments were incubated with the antigen. Bound 
AM2-svFv fragments were detected by the anti-HA-tag antibody. The experiment 
revealed a dose-dependent binding of AM2-scFv to its the AM2 antigen (Figure 8). 

Example 5: Display of conventional scFv on phage 
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The AM2-scFv fragment was first subcloning into phagemid vector pABMDl at 
the Hindlll/Notl sites. TGI cells carrying this phagemid vector were infected by the 
helper phage K07. The phages were purified firom the supematants. Phage ELISA 
assay was subsequently performed to detect AM2-scFv displayed on the phage particles. 
Because the coat pIU gene is tagged with HA-tag, the fiision can be detected with anti- 
HA antibodies. ELISA assay using AM2 antigen and anti-HA antibodies confirmed that 
the displayed scFv was capable of specifically binding to the corresponding antigens 
(Figure 9). The control involves phages displaying unrelated antibodies that are not HA 
tagged. 

Expression of the Subject Antigen-Binding Units in Eukaryotic Cells: 

Exam ple 6: Kx pression o f ccFv in yeast 

The yeast vector pAMEX7 carrying both VL and VH sequences that are linked 
to the subject heterodimerization sequences is constructed. Competent yeast cells, e.g. 
AH109 cell, are prepared and transformed with pAMEX7 vectors according to any 
method known in the art. The transformed yeast cells are cultured under conditions 
suitable for protein expression. Such conditions are well known to artisans in the field 
and hence are not detailed herein. The expressed ccFv Abus are harvested using 
conventional methods known in the art and/or procedures described herein. The antigen 
binding capability of the harvest ccFv are determined by ELISA according to the 
protocols described above. 
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